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METHODS AND COMPOSITIONS RELATING TO CD39-LIKE POLYPEPTIDES 
AND NUCLEIC ACIDS 

1. FIELD OF THE INVENTION 

The present invention provides novel polynucleotides and 
proteins encoded by such polynucleotides, along with 
therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 



2. BACKGROUND OF THE INVENTION 

Technology aimed at the discovery of protein factors 
(including e.g., cytokines, such as lymphokines, interferons, 
CSFs and inter leukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression 
cloning techniques clone novel polynucleotides "directly" in 
the sense that they rely on information directly related to 
the discovered protein (i.e., partial DNA/ amino acid sequence 
of the protein in the case of hybridization cloning; activity 
of the protein in the case of expression cloning) . In 
addition, more recently, "indirect" cloning techniques have 
been utilized, such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well- 
recognized secretory leader sequence motif, as well as 
various PCR-based or low stringency hybridization cloning 
techniques, coupled in certain instances with database 
searching, have advanced the state of the art by making 
available large numbers of DNA/amino acid sequences for 
proteins that are known to have biological activity by virtue 
of their secreted nature in the case of leader sequence 
cloning, by virtue of the cell or tissue source in the case 
of PCR-based techniques and/or by virtue of their sequence 
similarity via database searches, it is to these proteins 
and the polynucleotides encoding them that the present 
invention is directed. 



3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel 
isolated polypeptides, in particular, novel CD-39-like 
polypeptides, isolated polynucleotides encoding such 
5 polypeptides, including recombinant DNA molecules, cloned 
genes or degenerate variants thereof, especially naturally 
occurring variants such as allelic variants, and antibodies 
that specifically recognize one or more epitopes present on 
such polypeptides. 

The compositions of the present invention additionally 
10 include vectors, including expression vectors, containing the 
polynucleotides of the invention, cells genetically 
engineered to contain such polynucleotides and cells 
genetically engineered to express such polynucleotides. 

The isolated polynucleotides of the invention include, 
but are not limited to, a polynucleotide encoding a 
15 polypeptide comprising the amino acid sequence of SEQ ID NO: 2 
(also referred to herein as "CD3 9L2") ; or a polynucleotide 
encoding a polypeptide comprising amino acid residues 72-93, 
147-162, 191-211 OR 217-238 of SEQ ID NO: 2. 

In selected embodiments, such isolated polynucleotides 
of the invention represents a polynucleotide comprising the 
0 nucleotide sequence of SEQ ID NO:l; or a polynucleotide 

comprising nucleotides 232-1599, 445-510, 670-717, 802-864 or 
880-945 of the nucleotide sequence of SEQ ID NO:l. 

The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that 
hybridizes to the complement of the nucleotide sequence of 
5 SEQ ID NO:l under highly stringent hybridization conditions; 
a polynucleotide that hybridizes to the complement of the 
nucleotide sequence of SEQ ID N0:1 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 
) the proteins recited above; or a polynucleotide that encodes 
a polypeptide comprising a specific domain or truncation of 
the polypeptide of SEQ ID NO: 2. 
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The isolated polynucleotides of the invention still 
further include, but are not limited to, a polynucleotide - 
encoding a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 4 (also referred to herein as "CD39L3") ; or a 
5 polynucleotide encoding a polypeptide comprising amino acid 
residues 55-76, 132-150, 177-199 or 213-234 of SEQ ID N0:4. 

In selected embodiments, such isolated polynucleotides 
of the invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO: 3; or a polynucleotide 
comprising nucleotides 83-1669, 245-310, 476-532, 611-679 or 
10 719-784 of the nucleotide sequence of SEQ ID NO: 3. 

The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that 
hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO: 3 under highly stringent hybridization conditions; 
a polynucleotide that hybridizes to the complement of the 

15 nucleotide sequence of SEQ ID NO: 3 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 
the proteins recited above; or a polynucleotide that encodes 
a polypeptide comprising a specific domain or truncation of 

20 the polypeptide of SEQ ID NO: 4. 

The isolated polynucleotides of the invention still 
further include, but are not limited to, a polynucleotide 
encoding a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 6 (also referred to herein as "CD39L4") ; or a 
polynucleotide encoding a polypeptide comprising amino acid 

25 residues 47-68, 123-138, 167-187 or 193-214 of SEQ ID N0:6; 
or a polynucleotide encoding a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 9 (also referred to herein 
as dCD39L4") ; or a polynucleotide encoding amino acid 
residues 77-98, 153-167, 197-217 or 223-242 of SEQ ID NO: 9. 

In one embodiment, such isolated polynucleotides of the 

30 invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO: 5; or a polynucleotide 
comprising nucleotides 247-1530, 385-450, 613-660, 745-807 or 
823-888 of the nucleotide sequence of SEQ ID NO: 5. 
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The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that 
hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO: 5 under highly stringent hybridization conditions; 
5 a polynucleotide that hybridizes to the complement of the 

nucleotide sequence of SEQ ID NO: 5 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 
the proteins recited above; or a polynucleotide that encodes 
10 a polypeptide comprising a specific domain or truncation of 
the polypeptide of SEQ ID NO: 6. 

The isolated polynucleotides of the invention further 
include, but are not limited to, a polynucleotide encoding a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 8 
(also referred to herein as "mCD39L4" or "mNTPase") ; or a 
15 polynucleotide encoding amino acid residues 46-67, 122-140, 
166-187 or 194-213 of SEQ ID N0:S. 

In selected embodiments, such isolated polynucleotides 
of the invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO: 7; or a polynucleotide 
comprising nucleotides 205-1599, 340-395, 568-624, 700-765 or 
20 784-843 of the nucleotide sequence of SEQ ID NO: 7. 

The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that 
hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO: 7 under highly stringent hybridization conditions; 
a polynucleotide that hybridizes to the complement of the 
25 nucleotide sequence of SEQ ID NO: 7 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 
the proteins recited above; or a polynucleotide that encodes 
a polypeptide comprising a specific domain or truncation of 
30 the polypeptide of SEQ ID NO: 8. 

The polynucleotides of the invention additionally 
include the complement of any of the polynucleotides recited 
above . 



The isolated polypeptides of the invention further 
include, but are not limited to, a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2; or a polypeptide 
comprising amino acid residues 72-93, 147-162, 191-211 or 
5 217-238 of SEQ ID NO: 2. 

The isolated polypeptides of the invention still further 
include, but are not limited to, a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 4; or a polypeptide 
comprising amino acid residues 55-76, 132-150, 179-199 or 
213-234 of SEQ ID NO: 4. 
10 The isolated polypeptides of the invention include, but 

are not limited to, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 6; or a polypeptide comprising amino 
acid residues 47-68, 123-138, 167-187 or 193-214 of SEQ ID 
NO:6. 

The isolated polypeptides of the invention include, but 
15 are not limited to, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 8; or a polypeptide comprising amino 
acid residues 46-67, 122-140, 166-187 or 194-213 of SEQ ID 
NO: 8. 

The isolated polypeptides of the invention include, but 
are not limited to, a polypeptide comprising the amino acid 
20 sequence of SEQ ID NO: 9; or a polypeptide comprising amino 
acid residues 77-98, 153-167, 197-217 or 223-242 of SEQ ID 
NO: 9. 

Preferred embodiments include polypeptides that 
represent is mature forms of the polypeptides of the 
invention. 

25 Polypeptide compositions of the present invention may 

further comprise an acceptable carrier, such as a 
hydrophilic, e.g. , pharmaceutical^ acceptable, carrier. 

The invention also relates to methods for producing a 
polypeptide comprising growing a culture of the cells of the 
invention in a suitable culture medium, and purifying the 

30 protein from the culture or from an extract of the cells. 

Polynucleotides according to the invention have numerous 
applications in a variety of techniques known to those 
skilled in the art of molecular biology. These techniques 
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include use as hybridization probes, use as primers for PCR, 
use for chromosome and gene mapping, use in the recombinant 
production of protein, and use in generation of anti-sense 
DNA or RNA, their chemical analogs and the like. For 

5 example, when the expression of an mRNA is largely restricted 
to a particular cell or tissue type, polynucleotides of the 
invention can be used as hybridization probes to' detect the 
presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are 

10 used in diagnostics as expressed sequence tags for 

identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258 ; 52-59 (1992), as 
expressed sequence tags for physical mapping of the human 
genome . 

The polypeptides according to the invention can be used 

15 in a variety of conventional procedures and methods that are 
currently applied to other proteins. For example, the 
polypeptides of the invention can be used as molecular weight 
markers, and as a food supplement. In addition, a 
polypeptide of the invention can be used to generate an 
antibody that specifically binds the polypeptide. 

20 Methods are also provided for preventing, treating or 

ameliorating a medical condition which comprises 
administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the 
present invention and a pharmaceutically acceptable carrier. 
The polypeptides and polynucleotides of the invention 

25 can be utilized, for example, as part of methods for 

modulating ecto-ATPase activity and for identifying compounds 
that can be utilized as part of methods for modulating ecto- 
ATPase activity. Among the processes that can be modulated 
via such methods are processes involved in cell adhesion, 
apoptosis, vesicular transport, signalling, including 

30 purinergic, synaptic and neurotransmitter signalling, and 
purine recycling. The polypeptides of the invention having 
ADPase activity are also useful as anticoagulants and for 
inhibiting platelet aggregation. The polypeptides of the 



invention can further be utilized as anti-thrombotic agents, 
anti-tissue graft rejection agents, and/or as part of methods 
for regulating neurotransmission by ATP in smooth muscle, 
peripheral ganglia or the brain. 
5 The methods of the present invention further relate to 

methods for detecting the presence of the polynucleotides or 
polypeptides of the invention in a sample. Such methods can, 
for example, be utilized as part of prognostic and diagnostic 
evaluation of disorders as recited above and for the 
identification of subjects exhibiting a predisposition to 
10 such conditions. Furthermore, the invention provides methods 
for evaluating the efficacy of drugs, and monitoring the 
progress of patients, involved in clinical trials for the 
treatment of disorders as recited above. 

The invention also provides methods for the 
identification of compounds that modulate the expression of 
15 the polynucleotides and/or polypeptides of the invention. 
Such methods can be utilized, for example, for the 
identification of compounds that can ameliorate symptoms of 
disorders as recited above. Such methods can include, but 
are not limited to, assays for identifying compounds and 
other substances that interact with (e.g., bind to) the 
20 polypeptides of the invention. 

The methods of the invention also include methods for 
the treatment of disorders as recited above which may involve 
the administration of such compounds to individuals 
exhibiting symptoms or tendencies related to disorders as 
recited above. In addition, the invention encompasses 
25 methods for treating diseases or disorders as recited above 
by administering compounds and other substances that modulate 
the overall activity of the target gene products. Compounds 
and other substances can effect such modulation either on the 
level of target gene expression or target protein activity. 

30 4. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Top line: Nucleotide sequence of SEQ ID 
NO:7, referred to herein as mNTPase or mCD39L4; bottom line: 
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amino acid sequence of SEQ ID NO: 8, referred to herein as 
mNTPase or mCD39L4. 

Figure 2. Amino acid alignments of the full mNTPase 
5 (mCD39L4) amino acid sequence (SEQ ID NO: 8) and the most 
closely related other NTPase proteins: garden pea NTpase 
(SEQ ID NO: 10), potato apyrase (SEQ ID NO: 11), yeast GDPase 
(SEQ ID NO: 12). Identical residues are indicated by dark 
background (white letters) , while conserved residues are 
indicated by gray background. Alignments were made with 
10 pileup and boxshade from the Wisconsin Package Version 9.0, 
Genetics Computer Group (GCG) , Madison WI . 

Figure 3. Alignment of 12 members of the NTPase (or 
CD39-like) gene family indicating the conserved apryase 
regions I-IV. CD39=human (from Accession No. S73813; SEQ ID 

15 NO:13), ratCD39=rat (from Accession No. gill754710; SEQ ID 
NO:14), CD39Ll=human (Accession No. U91510; SEQ ID NO:15), 
ChickATPase=chicken (from Accession No. U74467; SEQ ID 
NO: 16), peaNTPase=garden pea (from Accession No. P52914; SEQ 
ID NO: 10), potRROPl=potato RROP1 gene (from Accession No. 
gill381633; SEQ ID NO: 11), yGDAl+y71KD=yeast genes (from 

20 Accession Nos. splP32621 + splP40009; SEQ ID NO:12), 

hCD39L2=CD39L2, celegans=c. Elegans gene (from Accession No. 
gill086594; SEQ ID NO: 17). Identical residues are indicated 
by date background (white letters) , conserved residues are 
indicated by gray background. Alignments were made with 
pileup and boxshade from the Wisconsin Package 9.0, Genetics 

25 computer Group (GCG), Madison, WI. Conserved portions of 
ACRs I-IV are boxed. 

Figure 4. Top line: Nucleotide sequence of SEQ ID 
N0:l, referred to herein as CD39L2; bottom line: amino acid 
sequence of SEQ ID NO: 2, referred to herein as CD39L2. 

30 

Figure 5. Comparison of the hydrophobicity predictions 
for the amino acid sequences of members of the human CD39- 
like gene family. Predictions were made using the Topred-II 



1.1 program (Claros, M.G. & Von Hejine, G. , 1994, Comput. 
Appl. Biosci. 10:685-686; putative setting=0.5; certain 
setting=l . 0 ) . 

5 Figure 6. Top line: Nucleotide sequence of SEQ ID 

NO: 3, referred to herein as CD39L3; bottom line: amino acid 
sequence of SEQ ID NO: 4, referred to herein as CD39L3. 

Figure 7. Top line: Nucleotide sequence of SEQ ID 
NO: 5, referred to herein as CD39L4; bottom line: amino acid 
10 sequence of SEQ ID NO: 6, referred to herein as CD39L4. 

Figure 8. Amino acid alignments of the full-length 
protein sequences for human members of the CD39-like gene 
family. CD39 (from Accession No. S73813; SEQ ID NO:13), 
CD39L1 (from Accession No. U91510; SEQ ID NO:15), CD39L2 (it 

15 is noted that the CD39L2 polypeptide illustrated here depicts 
a derived amino acid sequence that is encoded from the ATG 
codon beginning at nucleotide 14 8 (see FIG. 4) and, 
therefore, includes an additional 28 amino acid residues N- 
terminal to those depicted in FIG. 4; this form of CD39L2 is 
also intended to be included within the scope of the present 

20 invention), CD39L3, CD39L4. 

Identical residues are indicated by a black background 
(white letters) , and conserved residues are indicated by a 
gray background. Spaces in the sequences are indicated by a 
dot. Apyrase regions (ACRs) are indicated by arrows, with 
conserved portions of ACRs I-IV are highlighted by the boxed 

25 sections. Alignments were made using pileup and boxshade 
from the Wisconsin Package Version 9.0 Genetics Computer 
Group (GCG) , Madison, WI. 

Figure 9. Amino acid sequence of dCD39L4 ("dNTPase"; SEQ 
ID NO: 9) and alignment of the amino acid sequence with the 
30 most closely related members of the CD39-like gene family. 
peaGDP, garden pea NTPase (from Accession No. P52194; SEQ ID 
NO: 10); ptoapyrase, potato RROP1 gene (from Accession No. 
gill381633; SEQ ID NO:ll); CD39L2; CD39L4, and yGDPase, yeast 
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yGDAl gene (from Accession No. splP32621; SEQ ID NO: 12). 
Apyrase regions (ACRs) are indicated by arrows, with 
conserved portions of ACRs I-IV are highlighted by the boxed 
sections . 

5. DETAILED DESCRIPTION OF THE INVENTION 
5.1. Definitions 

The term "nucleotide sequence" refers to a heteropolymer 
of nucleotides or the sequence of these nucleotides. The 
terms "nucleic acid" and "polynucleotide" are also used 
interchangeably herein to refer to a heteropolymer of 
nucleotides. Generally, nucleic acid segments provided by 
this invention may be assembled from fragments of the genome 
and short oligonucleotide linkers, or from a series of 
oligonucleotides, or from individual nucleotides, to provide 
a synthetic nucleic acid which is capable of being expressed 
in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon, or a 
eukaryotic gene. In alternate embodiments, a nucleotide 
sequence, polynucleotide or nucleic acid can correspond to a 
genomic sequence f e.q. . can contain intron as well as exon 
sequence) or cDNA sequences (that is, contains no intron 
sequence) . 

The terms "oligonucleotide fragment" or a 
"polynucleotide fragment", "portion," or "segment" is a 
stretch of polypeptide nucleotide residues which is long 
enough to use in polymerase chain reaction (PCR) or various 
hybridization procedures to identify or amplify identical or 
related parts of mRNA or DNA molecules. 

The terms "oligonucleotides" or "nucleic acid probes" 
are prepared based on the polynucleotide sequences provided 
in the present invention. Oligonucleotides comprise portions 
of such a polynucleotide sequence having at least about 15 
nucleotides and usually at least about 20 nucleotides. 
Nucleic acid probes comprise portions of such a 
polynucleotide sequence having fewer nucleotides than about 6 
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kb, usually fewer than about 1 kb. After appropriate testing 
to eliminate false positives, these probes may, for example, 
be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic 
5 acid sequences from chromosomal DNA as described by Walsh et 
al. (Walsh, P.S. et al., 1992, PCR Methods Appl 1:241-250). 

The term "probes" includes naturally occurring or 
recombinant or chemically synthesized single- or double- 
stranded nucleic acids. They may be labeled by nick 
translation, Klenow fill-in reaction, PCR or other methods 
10 well known in the art. Probes of the present invention, 

their preparation and/or labeling are elaborated in Sambrook, 
J. et al., 1989, Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 
1989, Current Protocols in Molecular Biology, John Wiley & 
Sons, New York NY, both of which are incorporated herein by 
15 reference in their entirety. 

The "oligonucleotide fragment," "polynucleotide 
fragment," "portion," "segment,", "oligonucleotide" or 
"nucleic acid probe" is at least about 15, and preferably at 
least about 50, 100, 200, 300, 400, 500, 600, 700 or 800 
nucleotides in length. 
20 The term "stringent" is used to refer to conditions that 

are commonly understood in the art as stringent. Stringent 
conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA under in 0.5 M NaHP0 4 , 7% 
sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65° C, and washing 
in 0.1xSSC/0.1% SDS at 68° C) , and moderately stringent 
25 conditions (i.e., washing in 0.2xSSC/0.1% SDS at 42° C) . 
In instances wherein hybridization of 
deoxyoligonucleotides is concerned, additional exemplary 
highly stringent hybridization conditions include washing in 
6xSSC/0.05% sodium pyrophosphate at 37°C (for 14-base 
oligos) , 48°C (for 17-base oligos) , 55°C (for 20-base 
30 oligos) , and 60°C (for 23-base oligos) . 

The term "recombinant," when used herein to refer to a 
polypeptide or protein, means that a polypeptide or protein 
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is derived from recombinant (e.g., microbial or mammalian) 
expression systems. "Microbial" refers to recombinant 
polypeptides or proteins made in bacterial or fungal (e.g., 
yeast) expression systems. As a product, "recombinant 

5 microbial" defines a polypeptide or protein essentially free 
of native endogenous substances and unaccompanied by 
associated native glycosylation. Polypeptides or proteins 
expressed in most bacterial cultures, e.g. , jSLs. coli . will be 
free of glycosylation modifications; polypeptides or proteins 
expressed in yeast will have a glycosylation pattern in 

10 general different from those expressed in mammalian cells. 
The term "recombinant expression vehicle or vector" 
refers to a plasmid or phage or virus or vector, for 
expressing a polypeptide from a DNA (RNA) sequence. An 
expression vehicle can comprise a transcriptional unit 
comprising an assembly of (1) a genetic element or elements 

15 having a regulatory role in gene expression, for example, 
promoters or enhancers, (2) a structural or coding sequence 
which is transcribed into mRNA and translated into protein, 
and (3) appropriate transcription initiation and termination 
sequences. Structural units intended for use in yeast or 
eukaryotic expression systems preferably include a leader 

20 sequence enabling extracellular secretion of translated 
protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, 
it may include an N-terminal methionine residue. This 
residue may or may not be subsequently cleaved from the 
expressed recombinant protein to provide a final product. 

25 The term "recombinant expression system" means host 

cells which have stably integrated a recombinant 
transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit extrachromosomally . 
Recombinant expression systems as defined herein will express 
heterologous polypeptides or proteins upon induction of the 

30 regulatory elements linked to the DNA segment or synthetic 
gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or 
elements having a regulatory role in gene expression, for 
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example, promoters or enhancers. Recombinant expression 
systems as defined herein will express polypeptides or 
proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or 
5 gene to be expressed. The cells can be prokaryotic or 
eukaryotic. 

The term "open reading frame," ORF, means a series of 
nucleotide triplets coding for amino acids without any 
termination codons and is a sequence translatable into 
protein. 

10 The term "expression modulating fragment," EMF, means a 

series of nucleotides which modulates the expression of an 
operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the 
expression of an operably linked sequence" when the 
expression of the sequence is altered by the presence of the 

15 EMF. EMFs include, but are not limited to, promoters, and 
promoter modulating sequences (inducible elements). One 
class of EMFs are fragments which induce the expression or an 
operably linked ORF in response to a specific regulatory 
factor or physiological event. 

As used herein, an "uptake modulating fragment," UMF, 

20 means a series of nucleotides which mediate the uptake of a 
linked DNA fragment into a cell. UMFs can be readily 
identified using known UMFs as a target sequence or target 
motif with the computer-based systems described below. 

The presence and activity of a UMF can be confirmed by 
attaching the suspected UMF to a marker sequence. The 

25 resulting nucleic acid molecule is then incubated with an 
appropriate host under appropriate conditions and the uptake 
of the marker sequence is determined. As described above, a 
UMF will increase the frequency of uptake of a linked marker 
sequence. 

The term "active" refers to those forms of the 
30 polypeptide which retain the biological and/ or immunological 
activities of any naturally occurring polypeptide. 

The term "biologically active" refers to the biological 
activity of a naturally occurring polypeptide as well as to 



the ability of the polypeptide to exhibit an immunological 
activity. A polypeptide exhibits an "immunological activity" 
when antibodies can be generated that are directed against 
the polypeptide. 

5 The term "naturally occurring polypeptide" refers to 

polypeptides produced by cells that have not been genetically 
engineered and specifically contemplates various polypeptides 
arising from post-trans lational modifications of the 
polypeptide including, but not limited to, acetylation, 
carboxylation, glycosylation, phosphorylation, lipidation and 

10 acylation. 

The term "mature" refers to a polypeptide that has been 
postranslationally modified or that corresponds in primary 
amino acid sequence to a polypeptide that has been 
postranslationally modified. A mature polypeptide includes, 
but is not limited to, a polypeptide which comprises a 

15 primary amino acid sequence that has been processed from a 
"pre-," "pro-," or "pre-pro" amino acid sequence; a 
polypeptide which comprises a primary amino acid sequence 
corresponding to that of a polypeptide that has been 
processed from a "pre-," "pro-," or "pre-pro" amino acid 
sequence; a polypeptide that has been post-translationally 

20 modified via such modifications as, for example, acetylation, 
carboxylation, glycosylation, phosphorylation, lipidation and 
acylation. 

The term "derivative" refers to polypeptides chemically 
modified by such techniques as ubiquitination, labeling 
(e.g., with radionuclides or various enzymes), pegylation 

25 (derivatization with polyethylene glycol) and insertion or 
substitution by chemical synthesis of amino acids such as 
ornithine, which do not normally occur in human proteins. 

The term "recombinant variant" refers to any polypeptide 
differing from naturally occurring polypeptides by amino acid 
insertions, deletions, and substitutions, created using 

30 recombinant DNA techniques. Guidance in determining which 
amino acid residues may be replaced, added or deleted without 
abolishing activities of interest, such as cellular 
trafficking, may be found by comparing the sequence of the 
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particular polypeptide with that of homologous peptides and 
minimizing the number of amino acid sequence changes made in 
regions of high homology. 

Preferably, amino acid "substitutions" are the result of 
5 replacing one amino acid with another amino acid having 
similar structural and/or chemical properties, i.e., 
conservative amino acid replacements. Amino acid 
substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, 
and/or the amphipathic nature of the residues involved. For 

10 example, nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged 

15 (acidic) amino acids include aspartic acid and glutamic acid. 

"Insertions" or "deletions" are typically in the range 
of about 1 to 5 amino acids. The variation allowed may be 
experimentally determined by systematically making 
insertions, deletions, or substitutions of amino acids in a 
polypeptide molecule using recombinant DNA techniques and 

20 assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, 
insertions, deletions or non-conservative alterations can be 
engineered to produce altered polypeptides. Such alterations 
can, for example, alter one or more of the biological 
functions or biochemical characteristics of the polypeptides 

25 of the invention. For example, such alterations may change 
polypeptide characteristics such as ligand-binding 
affinities, interchain affinities, or degradation/turnover 
rate. Further, such alterations can be selected so as to 
generate polypeptides that are better suited for expression, 
scale up and the like in the host cells chosen for 

30 expression. For example, cysteine residues can be deleted or 
substituted with another amino acid residue in order to 
eliminate disulfide bridges. 
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As used herein, "substantially equivalent" can refer 
both to nucleotide and amino acid sequences, for example a 
mutant sequence, that varies from a reference sequence by one 
or more substitutions, deletions, or additions, the net 
5 effect of which does not result in an adverse functional 
dissimilarity between the reference and subject sequences. 
Typically, such a substantially equivalent sequence varies 
from one of those listed herein by no more than about 20%, 
i-e- , the number of individual residue substitutions, 
additions, and/or deletions in a substantially equivalent 

10 sequence, as compared to the corresponding reference 

sequence, divided by the total number of residues in the 
substantially equivalent sequence is about 0.2 or less. 
Such a sequence is said to have 80% sequence identity to the 
listed sequence. Such a substantially equivalent sequence 
can be routinely identified by applying the foregoing 

15 algorithm. 

In one embodiment, a substantially equivalent, e.g. , 
mutant, sequence of the invention varies from a listed 
sequence by no more than 10%, i.e. . the number of individual 
residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the 

20 corresponding reference sequence, divided by the total number 
of residues in the substantially equivalent sequence is about 
0.1 or less. Such a sequence is said to have 90% sequence 
identity to the listed sequence. Such a substantially 
equivalent sequence can be routinely identified by applying 
the foregoing algorithm. 

25 In an alternate embodiment a substantially equivalent 

sequence of the invention varies from a listed sequence by no 
more than by no more than 5%, i.e. f the number of individual 
residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 

30 of residues in the substantially equivalent sequence is about 
0.05 or less. Such a sequence is said to have 95% sequence 
identity to the listed sequence. Such a substantially 
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equivalent sequence can be routinely identified by applying 
the foregoing algorithm. 

In yet another alternate embodiment, a substantially 
equivalent sequence of the invention varies from a listed 
5 sequences by no more than 2%, i.e. , the number of individual 
residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 
of residues in the substantially equivalent sequence is about 
0.02 or less. Such a sequence is said to have 98% sequence 
10 identity to the listed sequence. Such a substantially 

equivalent sequence can be routinely identified by applying 
the foregoing algorithm. 

Substantially equivalent, e.g. f mutant, amino acid 
sequences according to the invention generally have at least 
95% sequence identity with a listed amino acid sequence, 
15 whereas substantially equivalent nucleotide sequence of the 
invention can have lower percent sequence identities, taking 
into account, for example, the redundancy or degeneracy of 
the genetic code. For the purposes of the present invention, 
sequences having substantially equivalent biological activity 
and substantially equivalent expression characteristics are 
20 considered substantially equivalent. In a preferred 

embodiment, for the purposes of determining equivalence, 
truncation of the mature sequence {e.g., via a mutation which 
creates a spurious stop codon) are disregarded. 

Nucleic acid sequences encoding such substantially 
equivalent sequences, e.g. , sequences of the recited percent 
25 identities, can also routinely be isolated and identified via 
standard hybridization procedures well known to those of 
skill in the art. 

Where desired, an expression vector may be designed to 
contain a "signal or leader sequence" which will direct the 
polypeptide through the membrane of a cell. Such a sequence 
30 may be naturally present on the polypeptides of the present 
invention or provided from heterologous protein sources by 
recombinant DNA techniques. 
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A polypeptide "fragment," "portion," or "segment" is a 
stretch of amino acid residues of at least about 5 amino 
acids, often at least about 7 amino acids or about at least 
about 9 to 13 amino acids, and, in various embodiments, at 
5 least about 17, 25, 50, 75, 100, 150, 200, 300, 400 or more 
amino acids. To be "active," any polypeptide must have 
sufficient length to display biologic and/or immunologic 
activity. 

Recombinant variants encoding these same or similar 
polypeptides may be synthesized or selected by making use of 
10 the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes which produce 
various restriction sites, may be introduced to optimize 
cloning into a plasmid or viral vector or expression in a 
particular prokaryotic or eukaryotic system. Mutations in 
the polynucleotide sequence may be reflected in the 
15 polypeptide or domains of other peptides added to the 
polypeptide to modify the properties of any part of the 
polypeptide, to change characteristics such as ligand-binding 
affinities, interchain affinities, or degradation/ turnover 
rate. Such variant nucleic acids and polypeptides are to be 
considered part of the present invention. 

20 The term "activated" cells as used herein refers to 

those cells that are engaged in extracellular or 
intracellular membrane trafficking, including the export of 
neurosecretory or enzymatic molecules as part of a normal or 
disease process. 

The term "purified" as used herein denotes that the 

25 indicated nucleic acid or polypeptide is present in the 

substantial absence of other biological macromolecules, e.g., 
polynucleotides, proteins, and the like. In one embodiment, 
the polynucleotide or polypeptide is purified such that it 
constitutes at least 95% by weight, more preferably at least 
99.8% by weight, of the indicated biological macromolecules 

30 present (but water, buffers, and other small molecules, 

especially molecules having a molecular weight of less than 
1000 daltons, can be present) . 
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The term "isolated" as used herein refers to a nucleic 
acid or polypeptide separated from at least one other 
component (e.g., nucleic acid or polypeptide) present with 
the nucleic acid or polypeptide in its natural source. In 
5 one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or 
other component normally present in a solution of the same. 
The terms "isolated" and "purified" do not encompass nucleic 
acids or polypeptides present in their natural source. 

The term "infection" refers to the introduction of 
10 nucleic acids into a suitable host cell by use of a virus or 
viral vector. 

The term "transformation" means introducing DNA into a 
suitable host cell so that the DNA is replicable, either as 
an extrachromosomal element, or by chromosomal integration. 
The term "transf ection" refers to the taking up of an 

15 expression vector by a suitable host cell, whether or not any 
coding sequences are in fact expressed. 

The term "intermediate fragment" means a nucleic acid 
between about 5 and about 1000 bases in length. In various 
embodiments, such nucleic acids are between about 10 and 
about 40 bp in length, or at least about 100, 2 00, 3 00, 400, 

20 500, 600, 700, 800 or 900 bp in length. 

The term "secreted" protein refers to a protein that is 
transported across or through a membrane, including transport 
as a result of signal sequences in its amino acid sequence 
when it is expressed in a suitable host cell. "Secreted" 
proteins include without limitation proteins secreted wholly 

25 (e.g., soluble proteins) or partially (e.g., receptors, 
including seven-transmembrane receptors) from the cell in 
which they are expressed. "Secreted" proteins also include 
without limitation proteins which are transported across the 
membrane of the endoplasmic reticulum. Each of the above 
terms is meant to encompasses all that is described for each, 

30 unless the context dictates otherwise. 



POLYNUCLEOTIDES AND NUCLEIC ACIDS OF THE INVENTION 



Nucleotide and amino acid sequences of the invention are 
reported below. Fragments of the proteins of the present 
invention which are capable of exhibiting biological activity 
are also encompassed by the present invention. Fragments of 
5 the protein may be in linear form or they may be cyclized 
using known methods, for example, as described in H. U. 
Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. 
S. McDowell, et al. , J. Amer. Chem. Soc. 114, 9245-9253 
(1992), both of which are incorporated herein by reference. 
Such fragments may be fused to carrier molecules such as 

10 immunoglobulins for many purposes, including increasing the 
valency of protein binding sites. For example, fragments of 
the protein may be fused through "linker" sequences to the Fc 
portion of an immunoglobulin. For a bivalent form of the 
protein, such a fusion could be to the Fc portion of an IgG 
molecule. Other immunoglobulin isotypes may also be used to 

15 generate such fusions. For example, a protein — IgM fusion 
would generate a decavalent form of the protein of the 
invention. 

The present invention also provides both full-length and 
mature forms of the disclosed proteins. The full-length forms 
of the polypeptides of the invention are identified in the 

20 figures and the sequence listing by translation of the 
nucleotide sequence of each nucleic acid molecule. Mature 
forms of the polypeptides of the invention can routinely be 
be obtained by expression of the disclosed nucleotides 
encoding the full-length polypeptides in a suitable mammalian 
cell or other host cell. The sequence of the mature forms of 

25 the polypeptides can also routinely be determined from the 
amino acid sequence of the full-length polypeptides. 

The present invention also provides genes corresponding 
to cDNA sequences disclosed herein. The corresponding genes 
can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include 

30 the preparation of probes or primers from the disclosed 

sequence information for identification and/or amplification 
of genes in appropriate genomic libraries or other sources of 
genomic materials. 
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Where the protein of the present invention is membrane- 
bound (e.g., is a receptor), the present invention also 
provides for soluble forms of such protein. In such forms 
part or all of the intracellular and transmembrane domains of 
5 the protein are deleted such that the protein is fully 
secreted from the cell in which it is expressed. The 
intracellular and transmembrane domains of proteins of the 
invention can be identified in accordance with known 
techniques for determination of such domains from sequence 
information. 

10 Species homologs of the disclosed polynucleotides and 

proteins are also provided by the present invention. Species 
homologs may be isolated and identified by making suitable 
probes or primers from the sequences provided herein and 
screening a suitable nucleic acid source from the desired 
species. Species homologs can include, but are not limited 
15 to human, murine, rat or Drosophila species homologs. 

The invention also encompasses allelic variants of the 
disclosed polynucleotides or proteins; that is, naturally- 
occurring alternative forms of the isolated polynucleotide 
which also encode proteins which are identical, homologous or 
related to that encoded by the polynucleotides. Sequences 
20 and allelic variant sequences of the invention can include, 
but are not limited to human, murine, rat and Drosophila 
sequences. 

The compositions of the present invention include 
isolated polynucleotides, including recombinant DNA 
molecules, cloned genes or degenerate variants thereof, 
25 especially naturally occurring variants such as allelic 
variants, novel isolated polypeptides, and antibodies that 
specifically recognize one or more epitopes present on such 
polypeptides . 



5.2. Nucleic acids of the Invention 

The isolated polynucleotides of the invention include, 
but are not limited to, a polynucleotide encoding a 



-21- 



NY2 - 924563. 1 



polypeptide comprising the amino acid sequence of SEQ ID NO: 2 
(also referred to herein as "CD39L2"); or a polynucleotide 
encoding a polypeptide comprising amino acid residues 72-93, 
147-162, 191-211 OR 217-238 of SEQ ID NO: 2. 
5 In selected embodiments, such isolated polynucleotides 

of the invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO:l; or a polynucleotide 
comprising nucleotides 232-1599, 445-510, 670-717, 802-864 or 
880-945 of the nucleotide sequence of SEQ ID NO:l. 

The polynucleotides of the present invention also 

10 include, but are not limited to, a polynucleotide that 

hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO:l under highly stringent hybridization conditions; 
a polynucleotide that hybridizes to the complement of the 
nucleotide sequence of SEQ ID NO:l under moderately stringent 
hybridization conditions; a polynucleotide which is an 

15 allelic variant of any polynucleotide recited above; a 

polynucleotide which encodes a species homologue of any of 
the proteins recited above; or a polynucleotide that encodes 
a polypeptide comprising a specific domain or truncation of 
the polypeptide of SEQ ID NO: 2. Such polynucleotides 
hybridize under the above conditions to the complement of SEQ 

20 ID NO:l or to a fragment of SEQ ID NO:l, wherein the fragment 
is greater than at least about 10 bp, and, in alternate 
embodiments, is about 20 to about 50 bp, or is greater than 
about 100 bp, 200 bp, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp, 
or 800 bp. 

The isolated polynucleotides of the invention still 
25 further include, but are not limited to, a polynucleotide 
encoding a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 4 (also referred to herein as "CD39L3") ; or a 
polynucleotide encoding a polypeptide comprising amino acid 
residues 55-76, 132-150, 179-199 or 213-234 of SEQ ID NO: 4. 
In selected embodiments, such isolated polynucleotides 
30 of the invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO: 3; or a polynucleotide 
comprising nucleotides 83-1669, 245-310, 476-532, 611-679 or 
719-784 of the nucleotide sequence of SEQ ID NO: 3. 
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The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that 
hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO: 3 under highly stringent hybridization conditions; 

5 a polynucleotide that hybridizes to the complement of the 

nucleotide sequence of SEQ ID NO: 3 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 
the proteins recited above; or a polynucleotide that encodes 

10 a polypeptide comprising a specific domain or truncation of 
the polypeptide of SEQ ID NO: 4. Such polynucleotides 
hybridize under the above conditions to the complement of SEQ 
ID NO: 3 or to a fragment of SEQ ID NO: 3, wherein the fragment 
is greater than at least about 10 bp, and, in alternate 
embodiments, is about 20 to about 50 bp, or is greater than 

15 about 100 bp, 200 bp, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp, 
or 800 bp. 

The isolated polynucleotides of the invention still 
further include, but are not limited to, a polynucleotide 
encoding a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 6 (also referred to herein as "CD39L4") ; or a 

20 polynucleotide encoding a polypeptide comprising amino acid 
residues 47-68, 123-138, 167-187 or 193-214 of SEQ ID NO: 6; 
or a polynucleotide encoding a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 9 (also referred to herein 
as dCD39L4") ; or a polynucleotide encoding amino acid 
residues 77-98, 153-167, 197-217 or 223-242 of SEQ ID NO: 9. 

25 In one embodiment, such isolated polynucleotides of the 

invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO: 5; or a polynucleotide 
comprising nucleotides 247-1530, 385-450, 613-660, 745-807 or 
823-888 of the nucleotide sequence of SEQ ID NO: 5. 

The polynucleotides of the present invention also 

30 include, but are not limited to, a polynucleotide that 

hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO: 5 under highly stringent hybridization conditions; 
a polynucleotide that hybridizes to the complement of the 
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nucleotide sequence of SEQ ID NO: 5 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 

5 the proteins recited above; or a polynucleotide that encodes 
a polypeptide comprising a specific domain or truncation of 
the polypeptide of SEQ ID NO: 6. Such polynucleotides 
hybridize under the above conditions to the complement of SEQ 
ID NO: 5 or to a fragment of SEQ ID NO: 5, wherein the fragment 
is greater than at least about 10 bp, and,, in alternate 

10 embodiments, is about 20 to about 50 bp, or is greater than 
about 100 bp, 200 bp, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp 
or 800 bp. 

The isolated polynucleotides of the invention further 
include, but are not limited to, a polynucleotide encoding a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 8 

15 (also referred to herein as "mCD39L4" or "mNTPase" ) ; or a 
polynucleotide encoding amino acid residues 46-67, 122-140, 
166-187 or 194-213 of SEQ ID N0:8. 

In selected embodiments, such isolated polynucleotides 
of the invention represents a polynucleotide comprising the 
nucleotide sequence of SEQ ID NO: 7; or a polynucleotide 

20 comprising nucleotides 205-1599, 340-395, 568-624, 700-765 or 
784-843 of the nucleotide sequence of SEQ ID NO: 7. 

The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that 
hybridizes to the complement of the nucleotide sequence of 
SEQ ID NO: 7 under highly stringent hybridization conditions; 

25 a polynucleotide that hybridizes to the complement of the 

nucleotide sequence of SEQ ID NO: 7 under moderately stringent 
hybridization conditions; a polynucleotide which is an 
allelic variant of any polynucleotide recited above; a 
polynucleotide which encodes a species homologue of any of 
the proteins recited above; or a polynucleotide that encodes 

30 a polypeptide comprising a specific domain or truncation of 
the polypeptide of SEQ ID NO: 8. Such polynucleotides 
hybridize under the above conditions to the complement of SEQ 
ID NO: 7 or to a fragment of SEQ ID NO: 7, wherein the fragment 
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is greater than at least about 10 bp, and, in alternate 
embodiments, is about 20 to about 50 bp, or is greater than 
about 100 bp, 200 bp, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp 
or 800 bp. 

5 The polynucleotides of the invention additionally 

include the complement of any of the polynucleotides recited 
above . 

The polynucleotides of the invention also provide 
polynucleotides that are substantially equivalent to the 
polynucleotides recited above. Typically, such a 

10 substantially equivalent sequence varies from one of those 
listed herein by no more than about 20%, i.e. , the number of 
individual nucleotide substitutions, additions, and/or 
deletions in a substantially equivalent sequence, as compared 
to the corresponding reference sequence, divided by the total 
number of nucleotides in the substantially equivalent 

15 sequence is about 0.2 or less. Such a sequence is said to 
have 80% sequence identity to the listed sequence. Such a 
substantially equivalent sequence can be routinely identified 
by applying the foregoing algorithm. 

In one embodiment, a substantially equivalent 
polynucleotide sequence of the invention varies from a listed 

20 sequence by no more than 10%, i.e. . the number of individual 
nucleotide substitutions, additions, and/ or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 
of nucleotides in the substantially equivalent sequence is 
about 0.1 or less. Such a sequence is said to have 90% 

25 sequence identity to the listed sequence. Such a 

substantially equivalent sequence can be routinely identified 
by applying the foregoing algorithm. 

In an alternate embodiment a substantially equivalent 
sequence of the invention varies from a listed sequence by no 
more than by no more than 5%, i.e. f the number of individual 

30 nucleotide substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 
of nucleotides in the substantially equivalent sequence is 
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about 0.05 or less. Such a sequence is said to have 95% 
sequence identity to the listed sequence. Such a 
substantially equivalent sequence can be routinely identified 
by applying the foregoing algorithm. 

5 In yet another alternate embodiment, a substantially 

equivalent sequence of the invention varies from a listed 
sequences by no more than 2%, i.e. f the number of individual 
nucleotide substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 

10 of nucleotides in the substantially equivalent sequence is 
about 0.02 or less. Such a sequence is said to have 98% 
sequence identity to the listed sequence. Such a 
substantially equivalent sequence can be routinely 
identified. 

A polynucleotide according to the invention can be 

15 joined to any of a variety of other nucleotide sequences by 
well-established recombinant DNA techniques (see Sambrook J 
et al. (1989) Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, NY) . Useful nucleotide sequences 
for joining to polypeptides include an assortment of vectors, 
e.g., plasmids, cosmids, lambda phage derivatives, phagemids, 

20 and the like, that are well known in the art. Accordingly, 
the invention also provides a vector including a 
polynucleotide of the invention and a host cell containing 
the polynucleotide. In general, the vector contains an 
origin of replication functional in at least one organism, 
convenient restriction endonuclease sites, and a selectable 

25 marker for the host cell. Vectors according to the invention 
include expression vectors, replication vectors, probe 
generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic 
cell and can be a unicellular organism or part of a 
multicellular organism. 

30 The sequences falling within the scope of the present 

invention are not limited to the specific sequences herein 
described, but also include allelic variations thereof. 
Allelic variations can be routinely determined by comparing 
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the sequence provided in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 5 or SEQ ID NO: 7, a representative intermediate fragment 
thereof, or a nucleotide sequence at least 99.9% identical to 
SEQ ID NO:l, SEQ ID N0:3, SEQ ID N0:5 or SEQ ID NO:7 with a 
5 sequence from another isolate of the same species. 

Furthermore, to accommodate codon variability, the invention 
includes nucleic acid molecules coding for the same amino 
acid sequences as do the specific ORFs disclosed herein. In 
other words, in the coding region of an ORF, substitution of 
one codon for another which encodes the same amino acid is 
10 expressly contemplated. 

It is to be understood that nucleic acid molecules 
consisting of the following nucleotide sequences are not 
considered part of the present invention: the nucleotide 
sequence or, where appropriate, the nucleotide sequence that 
encodes the depicted amino acid sequence, of Genbank™ 
IS accession number S73813, gill754710, U91510, U91511, 

AA116990, AA120757, H08436, AA378537, AA336644, AA338117, 
AA337885, N72742, AA256016, AA611283, AA647051, AA638277, 
AA271520, W46136, AA391695, AA390461, AA201196, AA246996, 
AA567512 or AC002032. 

The present invention further provides recombinant 
20 constructs comprising a nucleic acid having the sequence of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, or SEQ ID NO:7 or an 
intermediate fragment thereof, or another of the nucleic acid 
molecules of the invention. The recombinant constructs of 
the present invention comprise a vector, such as a plasmid or 
viral vector, into which a nucleic acid having the sequence 
25 of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO:7, or 
an intermediate fragment thereof, or another of the nucleic 
acid molecules of the invention, is inserted, in a forward or 
reverse orientation. In the case of a vector comprising one 
of the ORFs of the present invention, the vector may further 
comprise regulatory sequences, including for example, a 
30 promoter, operably linked to the ORF. For vectors comprising 
the EMFs and UMFs of the present invention, the vector may 
further comprise a marker sequence or heterologous ORF 
operably linked to the EMF or UMF. Large numbers of suitable 
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vectors and promoters are known to those of skill in the art 
and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors 
are provided by way of example. Bacterial: pBs, 
5 phagescript, PsiX174, pBluescript SK, pBs KS, pNK8a, pNH16a, 
pNH18a, pNH46a (Stratagene) ; pTrc99A, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia) . Eukaryotic: pWLneo, pSV2cat, 
pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia) . 

The isolated polynucleotides of the invention may be 
10 operably linked to an expression control sequence such as the 
pMT2 or pED expression vectors disclosed in Kaufman et al., 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce 
the protein recombinant ly. Many suitable expression control 
sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. 
15 Kaufman, Methods in Enzymology 185, 537-566 (1990) . As 
defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way 
that the protein is expressed by a host cell which has been 
transformed (transf ected) with the ligated polynucleotide/ 
20 expression control sequence. 

Promoter regions can be selected from any desired gene 
using CAT (chloramphenicol transferase) vectors or other 
vectors with selectable markers. Two appropriate vectors are 
PKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda P R , and trc. 
25 Eukaryotic promoters include CMV immediate early, HSV 

thymidine kinase, early and late SV40, LTRs from retrovirus, 
and mouse metallothionein-I. Selection of the appropriate 
vector and promoter is well within the level of ordinary 
skill in the art. Generally, recombinant expression vectors 
will include origins of replication and selectable markers 
30 permitting transformation of the host cell, e.g., the 

ampicillin resistance gene of E. coll and S. cerevisiae TRP1 
gene, and a promoter derived from a highly-expressed gene to 
direct transcription of a downstream structural sequence. 



Such promoters can be derived from operons encoding 
glycolytic enzymes such as 3-phosphoglycerate kinase (PGK) , 
a-f actor, acid phosphatase, or heat shock proteins, among 
others. The heterologous structural sequence is assembled in 
5 appropriate phase with translation initiation and termination 
sequences, and preferably, a leader sequence capable of 
directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the 
heterologous sequence can encode a fusion protein including 
an N-terminal identification peptide imparting desired 
10 characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. Useful 
expression vectors for bacterial use are constructed by 
inserting a structural DNA sequence encoding a desired 
protein together with suitable translation initiation and 
termination signals in operable reading phase with a 
15 functional promoter. The vector will comprise one or more 
phenotypic selectable markers and an origin of replication to 
ensure maintenance of the vector and to, if desirable, 
provide amplification within the host. Suitable prokaryotic 
hosts for transformation include E. coli, Bacillus subtilis, 
Salmonella typhimurium and various species within the genera 
20 Pseudomonas, Streptomyces , and Staphylococcus, although 
others may also be employed as a matter of choice. 

As a representative but non-limiting example, useful 
expression vectors for bacterial use can comprise a 
selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic 
^ elements of the well known cloning vector pBR322 (ATCC 
37017) . Such commercial vectors include, for example, 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 
1 (Promega Biotec, Madison, WI, USA) . These pBR322 
"backbone" sections are combined with an appropriate promoter 
and the structural sequence to be expressed. Following 
30 transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected 
promoter is induced or derepressed by appropriate means 
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(e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. Cells are typically 
harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for 
5 further purification. 

The nucleic acid sequences of the invention are further 
directed to sequences which encode variants of the described 
nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing 
appropriate nucleotide changes into a native or variant 

10 polynucleotide. There are two variables in the construction 
of amino acid sequence variants: the location of the 
mutation and the nature of the mutation. The amino acid 
sequence variants of the nucleic acids are preferably 
constructed by mutating the polynucleotide to give an amino 
acid sequence that does not occur in nature. These amino 

15 acid alterations can be made at sites that differ in the 

nucleic acids from different species (variable positions) or 
in highly conserved regions (constant regions) . Sites at 
such locations will typically be modified in series, e.g., by 
substituting first with conservative choices (e.g., 
hydrophobic amino acid to a different hydrophobic amino acid) 

^ and then with more distant choices (e.g., hydrophobic amino 
acid to a charged amino acid) , and then deletions or 
insertions may be made at the target site. Amino acid 
sequence deletions generally range from about 1 to 30 
residues, preferably about 1 to 10 residues, and are 

25 typically contiguous. Amino acid insertions include amino- 
and/or carboxyl-terminal fusions ranging in length from one 
to one hundred or more residues, as well as intrasequence 
insertions of single or multiple amino acid residues. 
Intrasequence insertions may range generally from about 1 to 
10 amino residues, preferably from 1 to 5 residues. Examples 

^ of terminal insertions include the heterologous signal 
sequences necessary for secretion or for intracellular 
targeting in different host cells. 
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In a preferred method, polynucleotides encoding the 
novel nucleic acids are changed via site-directed 
mutagenesis. This method uses oligonucleotide sequences that 
encode the polynucleotide sequence of the desired amino acid 

5 variant, as well as a sufficient adjacent nucleotide on both 
sides of the changed amino acid to form a stable duplex on 
either side of the site of being changed. In general, the 
techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified 
by publications such as, Edelman et al., DNA 2.: 183 (1983). A 

10 versatile and efficient method for producing site-specific 
changes in a polynucleotide sequence was published by Zoller 
and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may 
also be used to create amino acid sequence variants of the 
novel nucleic acids. When small amounts of template DNA are 
used as starting material, primer (s) that differs slightly in 

15 sequence from the corresponding region in the template DNA 
can generate the desired amino acid variant. PCR 
amplification results in a population of product DNA 
fragments that differ from the polynucleotide template 
encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding 
20 region in the plasmid and this gives the desired amino acid 
variant. 

A further technique for generating amino acid variants 
is the cassette mutagenesis technique described in Wells et 
al., Gene 34 :315 (1985); and other mutagenesis techniques 
well known in the art, such as, for example, the techniques 
in Sambrook et al . , supra . and Current Protocols in Molecular 
Biology . Ausubel et al . Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode 
substantially the same or a functionally equivalent amino 
acid sequence may be used in the practice of the invention 
30 for the cloning and expression of these novel nucleic acids. 
Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence 
under stringent conditions. 
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Finally, it is to be understood that the nucleic acid 
molecules of the invention further include any nucleic acid 
molecule that encodes the polypeptides of the invention, as 
described in Section 5.4, below. 

5 

5.3. Hosts 

The present invention further provides host cells 
genetically engineered to contain the polynucleotides of the 
invention. For example, such host cells may contain nucleic 
acids of the invention introduced into the host cell using 

10 known transformation, transfection or infection methods. The 
present invention still further provides host cells 
genetically engineered to express the polynucleotides of the 
invention, wherein such polynucleotides are in operative 
association with a regulatory sequence heterologous to the 
host cell which drives expression of the polynucleotides in 

15 the cell. 

The host cell can be a higher eukaryotic host cell, such 
as a mammalian cell, a lower eukaryotic host cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such 
as a bacterial cell. Introduction of the recombinant 
construct into the host cell can be effected by calcium 

20 phosphate transfection, DEAE, dextran mediated transfection, 
or electroporation (Davis, L. et al., Basic Methods in 
Molecular Biology (1986)). The host cells containing one of 
polynucleotides of the invention, can be used in conventional 
manners to produce the gene product encoded by the isolated 

^ fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or 
more of the ORFs of the present invention. These include, 
but are not limited to, eukaryotic hosts such as HeLa cells, 
Cv-1 cell, COS cells, and Sf9 cells, as well .as prokaryotic 
host such as E. coli and B . subtil is . The most preferred 

30 

cells are those which do not normally express the particular 
polypeptide or protein or which expresses the polypeptide or 
protein at low natural level. Mature proteins can be 
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expressed in mammalian cells, yeast, bacteria, or other cells 
under the control of appropriate promoters. Cell-free 
translation systems can also be employed to produce such 
proteins using RNAs derived from the DNA constructs of the 
5 present invention- Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are 
described by Sambrook, et al . , in Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1989), the disclosure of which is hereby incorporated 
by reference. 

*® Various mammalian cell culture systems can also be 

employed to express recombinant protein. Examples of 
mammalian expression systems include the COS-7 lines of 
monkey kidney fibroblasts, described by Gluzman, Cell 23:175 
(1981) , and other cell lines capable of expressing a 

15 compatible vector, for example, the C127, 3T3, CHO, HeLa and 
BHK cell tines. Mammalian expression vectors will comprise 
an origin of replication, a suitable promoter and also any 
necessary ribosome binding sites, polyadenylation site, 
splice donor and acceptor sites, transcriptional termination 
sequences, and 5' flanking nontranscribed sequences. DNA 

20 sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and 
polyadenylation sites may be used to provide the required 
nontranscribed genetic elements. Recombinant polypeptides and 
proteins produced in bacterial culture are usually isolated 
by initial extraction from cell pellets, followed by one or 

25 more salting-out, aqueous ion exchange or size exclusion 
chromatography steps. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature 
protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 
Microbial cells employed in expression of proteins can be 
disrupted by any convenient method, including freeze-thaw 
cycling, sonication, mechanical disruption, or use of cell 
lysing agents. 
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A number of types of cells may act as suitable host 
cells for expression of the protein. Mammalian host cells 
include, for example, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, human kidney 293 cells, human epidermal A431 
5 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other 
transformed primate cell lines, normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, 
primary explants, HeLa cells, mouse L cells, BHK, HL-60, 
U937, HaK or Jurkat cells. 

Alternatively, it may be possible to produce the protein 

10 in lower eukaryotes such as yeast or in prokaryotes such as 
bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Kluyveromyces strains, Candida, or any yeast strain capable 
of expressing heterologous proteins. Potentially suitable 
bacterial strains include Escherichia coli, Bacillus 

15 subtilis, Salmonella typhimurium, or any bacterial strain 
capable of expressing heterologous proteins. If the protein 
is made in yeast or bacteria, it may be necessary to modify 
the protein produced therein, for example by phosphorylation 
or glycosylation of the appropriate sites, in order to obtain 
the functional protein. Such covalent attachments may be 

20 accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells 
and tissues may be engineered to express an endogenous gene 
comprising the polynucleotides of the invention under the 1 
control of inducible regulatory elements, in which case the 
regulatory sequences of the endogenous gene may be replaced 

25 by homologous recombination. As described herein, gene 

targeting can be used to replace a gene's existing regulatory 
region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic 
engineering methods. Such regulatory sequences may be 
comprised of promoters, enhancers, scaffold-attachment 

30 regions, negative regulatory elements, transcriptional 
initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences 
which affect the structure or stability of the RNA or protein 
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produced may be replaced, removed, added, or otherwise 
modified by targeting, including polyadenylation signals. 
mRNA stability elements, splice sites, leader sequences for 
enhancing or modifying transport or secretion properties of 
5 the protein, or other sequences which alter or improve the 
function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the 
regulatory sequence, placing the gene under the control of 
the new regulatory sequence, e.g. . inserting a new promoter 
or enhancer or both upstream of a gene. Alternatively, the 
10 targeting event may be a simple deletion of a regulatory 

element, such as the deletion of a tissue-specific negative > 
regulatory element. Alternatively, the targeting event may 
replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or 
different cell-type specificity than the naturally occurring 
15 elements. Here, the naturally occurring sequences are 
deleted and new sequences are added. In all cases, the 
identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are 
contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the 
20 host cell genome. The identification of the targeting event 
may also be facilitated by the use of one or more marker 
genes exhibiting the property of negative selection, such 
that the negatively selectable marker is linked to the 
exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such 
25 that a correct homologous recombination event with sequences 
in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers 
useful for this purpose include the Herpes Simplex Virus 
thymidine kinase (TK) gene or the bacterial xanthine -guanine 
phosphor ibosyl -transferase (gpt) gene. 
30 The gene targeting or gene activation techniques which 

can be used in accordance with this aspect of the invention 
are more particularly described in U.S. Patent No. 5,272,071 
to Chappel; U.S. Patent No. 5,578,461 to sherwin et al.; 
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International Application No. PCT/US92/09627 (WO93/09222) by 
Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of 
which is incorporated by reference herein in its entirety. 

5 

5.4. Polypeptides of the Invention 

The isolated polypeptides of the invention include, but 
are not limited to, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 2; or a polypeptide comprising amino 
acid residues 72-93, 147-162, 191-211 OR 217-238 of SEQ ID 
10 NO: 2. 

The isolated polypeptides of the invention include, but 
are not limited to, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 4; or a polypeptide comprising amino 
acid residues 55-76, 132-150, 179-199 or 213-234 of SEQ ID 
NO:4. 

15 The isolated polypeptides of the invention include, but 

are not limited to, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 6; or a polypeptide comprising amino 
acid residues 47-68, 123-138, 167-187 or 193-214 of SEQ ID 
NO: 6. 

The isolated polypeptides of the invention include, but 
20 are not limited to, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 8; or a polypeptide comprising amino 
acid residues 46-67, 122-140, 166-187 or 194-213 of SEQ ID 
NO: 8. 

The isolated polypeptides of the invention include, but 
are not limited to, a polypeptide comprising the amino acid 
25 sequence of SEQ ID NO: 9; or a polypeptide comprising amino 
acid residues 77-98, 153-167, 197-217 or 223-242 of SEQ ID 
NO: 9. 

The isolated polypeptides of the invention further 
include polypeptides that are substantially equivalent to the 
polypeptides of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
30 NO: 8 or SEQ ID NO: 9 or to specific domains thereof. 

Typically, such a substantially equivalent sequence varies 
from one of those listed herein by no more than about 20%, 
i.e. . the number of individual amino acid residue 
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substitutions, additions, and/or deletions in a substantially 
equivalent sequence, as compared to the corresponding 
reference sequence, divided by the total number of amino acid 
residues in the substantially equivalent sequence is about 
5 0.2 or less. Such a sequence is said to have 80% sequence 
identity to the listed sequence. Such a substantially 
equivalent sequence can be routinely identified by applying 
the foregoing algorithm. 

In one embodiment, a substantially equivalent 
polypeptide sequence of the invention varies from a listed 
10 sequence by no more than 10%, i.e. . the number of individual 
amino acid substitutions, additions, and/ or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 
of amino acid residues in the substantially equivalent 
sequence is about 0.1 or less. Such a sequence is said to 
15 have 90% sequence identity to the listed sequence. Such a 
substantially equivalent sequence can be routinely identified 
by applying the foregoing algorithm. 

In an alternate embodiment a substantially equivalent 
sequence of the invention varies from a listed sequence by no 
more than by no more than 5%, i.e. , the number of individual 
20 amino acid substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 
of amino acid residues in the substantially equivalent 
sequence is about 0.05 or less. Such a sequence is said to 
have 95% sequence identity to the listed sequence. Such a 
25 substantially equivalent sequence can be routinely identified 
by applying the foregoing algorithm. 

In yet another alternate embodiment, a substantially 
equivalent sequence of the invention varies from a listed 
sequences by no more than 2%, i.e. . the number of individual 
amino acid residue substitutions, additions, and/or deletions 
30 in a substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number 
of amino acid residues in the substantially equivalent 
sequence is about 0.02 or less. Such a sequence is said to 
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have 98% sequence identity to the listed sequence. Such a 
substantially equivalent sequence can be routinely 
identified. 

Preferred embodiments include those in which the proteir 
5 produced by such process is a mature form of the protein. 
Protein compositions of the present invention may 
further comprise an acceptable carrier, such as a 
hydrophilic, e.g. , pharmaceutically acceptable, carrier. 

The invention also relates to methods for producing a 
polypeptide comprising growing a culture of the cells of the 
10 invention in a suitable culture medium, and purifying the 
protein from the culture. For example, the methods of the 
invention include a process for producing a polypeptide in 
which a host cell containing a suitable expression vector 
that includes a polynucleotide of the invention is cultured 
under conditions that allow expression of the encoded 
15 polypeptide. The polypeptide can be recovered from the 
culture, conveniently from the culture medium, and further 
purified. 

The present invention further provides isolated 
polypeptides encoded by the nucleic acid fragments of the 
present invention or by degenerate variants of the nucleic 
20 acid fragments of the present invention. By "degenerate 

variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g., an ORF) 
by nucleotide sequence but, due to the degeneracy of the 
genetic code, encode an identical polypeptide sequence. 
Preferred nucleic acid fragments of the present invention are 
25 the ORFs that encode proteins. 

A variety of methodologies known in the art can be 
utilized to obtain any one of the isolated polypeptides or 
proteins of the present invention. At the simplest level, 
the amino acid sequence can be synthesized using commercially 
available peptide synthesizers. This is particularly useful 
30 in producing small peptides and fragments of larger 
polypeptides. Fragments are useful, for example, in 
generating antibodies against the native polypeptide. In an 
alternative method, the polypeptide or protein is purified 



-38- 



NY2 - 924563.1 



from bacterial cells which naturally produce the polypeptide 
or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to 
obtain one of the isolated polypeptides or proteins of the 
5 present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, 
ion-exchange chromatography, and immuno-af f inity 
chromatography. See, e.g., Scopes, Protein Purification: 
Principles and Practice, Springer-Verlag (1994); Sambrook, et 
^ al . , in Molecular Cloning: A Laboratory Manual ; Ausubel et 
al., Current Protocols in Molecular Biology. 

The polypeptides and proteins of the present invention 
can alternatively be purified from cells which have been 
altered to express the desired polypeptide or protein. As 
used herein, a cell is said to be altered to express a 
15 desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein 
which it normally does not produce or which the cell normally 
produces at a lower level. One skilled in the art can 
readily adapt procedures for introducing and expressing 
either recombinant or synthetic sequences into eukaryotic or 
20 prokaryotic cells in order to generate a cell which produces 
one of the polypeptides or proteins of the present invention. 
The purified polypeptides can be used in in vitro binding 
assays which are well known in the art to identify molecules 
which bind to the polypeptides. These molecules include but 
are not limited to, for e.g., small molecules, molecules from 
25 combinatorial libraries, antibodies or other proteins. The 
molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or 
animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or 
30 prolonged survival of the animal/cells. 

In addition, the binding molecules may be complexed with 
toxins, e.g., ricin or cholera, or with other compounds that 
are toxic to cells. The toxin-binding molecule complex is 
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then targeted to the tumor or other cell by the specificity 
of the binding molecule for SEQ ID NO: 2, SEQ ID NO: 4 SEQ ID 
NO: 6, SEQ ID NO: 8 or SEQ ID NO: 9 , or another of the 
polypeptide of the invention. 
5 The protein of the invention may also be expressed as a 

product of transgenic animals, e.g., as a component of the 
milk of transgenic cows, goats, pigs, or sheep which are 
characterized by somatic or germ cells containing a 
nucleotide sequence encoding the protein. 

The protein may also be produced by known conventional 

10 chemical synthesis. Methods for constructing the proteins of 
the present invention by synthetic means are known to those 
skilled in the art. The synthetically-constructed protein 
sequences, by virtue of sharing primary, secondary or 
tertiary structural and/or conformational characteristics 
with proteins may possess biological properties in common 

15 therewith, including protein activity. Thus, they may be 

employed as biologically active or immunological substitutes 
for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development 
of ant ibodi es . 

The proteins provided herein also include proteins 

20 characterized by amino acid sequences similar to those of 
purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, 
modifications in the peptide or DNA sequences can be made by 
those skilled in the art using known techniques. 
Modifications of interest in the protein sequences may 

25 include the alteration, substitution, replacement, insertion 
or deletion of a selected amino acid residue in the coding 
sequence. For example, one or more of the cysteine residues 
may be deleted or replaced with another amino acid to alter 
the conformation of the molecule. Techniques for such 
alteration, substitution, replacement, insertion or deletion 

30 are well known to those skilled in the art (see, e.g., U.S. 
Pat. No. 4,518,584). Preferably, such alteration, 
substitution, replacement, insertion or deletion retains the 
desired activity of the protein. 
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Other fragments and derivatives of the sequences of 
proteins which would be expected to retain protein activity 
in whole or in part and may thus be useful for screening or 
other immunological methodologies may also be easily made by 
5 those skilled in the art given the disclosures herein. Such 
modifications are believed to be encompassed by the present 
invention. 

The protein may also be produced by operably linking the 
isolated polynucleotide of the invention to suitable control 
sequences in one or more insect expression vectors, and 

10 employing an insect expression system. Materials and methods 
for baculovirus/ insect cell expression systems are 
commercially available in kit form from, e.g., Invitrogen, 
San Diego, Calif., U.S.A. (the MaxBat.RTM. kit), and such 
methods are well known in the art, as described in Summers 
and Smith, Texas Agricultural Experiment Station Bulletin No. 

15 1555 (1987) , incorporated herein by reference. As used 

herein, an insect cell capable of expressing a polynucleotide 
of the present invention is "transformed." 

The protein of the invention may be prepared by 
culturing transformed host cells under culture conditions 
suitable to express the recombinant protein. The resulting 

20 expressed protein may then be purified from such culture 
(i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion 
exchange chromatography. The purification of the protein may 
also include an affinity column containing agents which will 
bind to the protein; one or more column steps over such 

25 affinity resins as concanavalin A-agarose, heparin- 

toyopearl.RTM. or Cibacrom blue 3GA Sepharose.RTM. ; one or 
more steps involving hydrophobic interaction chromatography 
using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaf f inity chromatography. 

Alternatively, the protein of the invention may also be 

30 expressed in a form which will facilitate purification. For 
example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP) , glutathione-S- 
transferase (GST) or thioredoxin (TRX) . Kits for expression 
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and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, Mass.)/ Pharmacia 
(Piscataway, N.J.) and In Vitrogen, respectively. The protein 
can also be tagged with an epitope and subsequently purified 

5 by using a specific antibody directed to such epitope. One 
such epitope ("Flag") is commercially available from Kodak 
(New Haven, Conn.). 

Finally, one or more reverse-phase high performance 
liquid chromatography (RP-HPLC) steps employing hydrophobic 
RP-HPLC media, e.g., silica gel having pendant methyl or 

10 other aliphatic groups, can be employed to further purify the 
protein. Some or all of the foregoing purification steps, in 
various combinations, can also be employed to provide a 
substantially homogeneous isolated recombinant protein. The 
protein thus purified is substantially free of other 
mammalian proteins and is defined in accordance with the 

15 present invention as an "isolated protein." 

It is to be understood that polypeptides consisting of 
the following amino acid sequence are not considered part of 
the present invention: the amino acid sequence of, or, where 
appropriate, reported to be encoded by the nucleotide 
sequence of Genbank™ accession No :S73813, gill754710, 

20 U91510, U91511, AA116990, AA120757, H08436, AA378537, 

AA336644, AA338117, AA337885, N72742, AA256016, AA611283, 
AA647051, AA638277, AA271520, W46136, AA391695, AA390461, 
AA201196, AA246996 or AA567512. 

5.5. Dses and Biological Activity 
25 The polynucleotides and proteins of the present 

invention are expected to exhibit one or more of the uses or 
biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described 
for proteins of the present invention may be provided by 
administration or use of such proteins or by administration 
30 or use of polynucleotides encoding such proteins (such as, 
for example, in gene therapies or vectors suitable for 
introduction of DNA) . 



5.5.1. Research Uses and Utilities 
The polynucleotides provided by the present invention 
can be used by the research community for various purposes. 
The polynucleotides can be used to express recombinant 

5 protein for analysis, characterization or therapeutic use; as 
markers for tissues in which the corresponding protein is 
preferentially expressed (either constitutive ly or at a 
particular stage of tissue differentiation or development or 
in disease states) ; as molecular weight markers on Southern 
gels; as chromosome markers or tags (when labeled) to 

10 identify chromosomes or to map related gene positions; to 

compare with endogenous DNA sequences in patients to identify 
potential genetic disorders; as probes to hybridize and thus 
discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; 
as a probe to "subtract-out" known sequences in the process 

15 of discovering other novel polynucleotides; for selecting and 
making oligomers for attachment to a "gene chip" or other 
support, including for examination of expression patterns; to 
raise anti-protein antibodies using DNA immunization 
techniques; and as an antigen to raise anti-DNA antibodies or 
elicit another immune response. Where the polynucleotide 

20 encodes a protein which binds or potentially binds to another 
protein (such as, for example, in a receptor- ligand 
interaction) , the polynucleotide can also be used in 
interaction trap assays (such as, for example, that described 
in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding 

25 occurs or to identify inhibitors of the binding interaction. 
The proteins provided by the present invention can 
similarly be used in assay to determine biological activity, 
including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune 
response; as a reagent (including the labeled reagent) in 

30 assays designed to quantitatively determine levels of the 
protein (or its receptor) in biological fluids; as markers 
for tissues in which the corresponding protein is 
preferentially expressed (either constitutively or at a 
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particular stage of tissue differentiation or development or 
in a disease state) ; and, of course, to isolate correlative 
receptors or ligands. Where the protein binds or potentially 
binds to another protein (such as, for example, in a 

5 receptor-ligand interaction) , the protein can be used to 
identify the other protein with which binding occurs or to 
identify inhibitors of the binding interaction. Proteins 
involved in these binding interactions can also be used to 
screen for peptide or small molecule inhibitors or agonists 
of the binding interaction. 

10 Any or all of these research utilities are capable of 

being developed into reagent grade or kit format for 
commercialization as research products. 

Methods for performing the uses, listed above are well 
known to those skilled in the art. References disclosing such 
methods include without limitation "Molecular Cloning: A 

15 Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory 
Press, Sambrook, J., E. F. Fritsch and T. Maniatis eds., 
1989, and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel 
eds., 1987. 

20 5.5.2. Nutritional Uses 

Polynucleotides and proteins of the present invention 
can also be used as nutritional sources or supplements. Such 
uses include without limitation use as a protein or amino 
acid supplement, use as a carbon source, use as a nitrogen 
source and use as a source of carbohydrate. In such cases the 

25 protein or polynucleotide of the invention can be added to 
the feed of a particular organism or can be administered as a 
separate solid or liquid preparation, such as in the form of 
powder, pills, solutions, suspensions or capsules. In the 
case of microorganisms, the protein or polynucleotide of the 
invention can be added to the medium in or on which the 

30 microorganism is cultured. 
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5.5.3. Cytokine and Cell Proliferation/ 
Differentiation Activity 

A protein of the present invention may exhibit cytokine, 
cell proliferation (either inducing or inhibiting) or cell 

5 differentiation (either inducing or inhibiting) activity or 
may induce production of other cytokines in certain cell 
populations. A polynucleotide of the invention can encode a 
polypeptide exhibiting such attributes. Many protein factors 
discovered to date, including all known cytokines, have 
exhibited activity in one or more factor-dependent cell 

10 proliferation assays, and hence the assays serve as a 

convenient confirmation of cytokine activity. The activity of 
a protein of the present invention is evidenced by any one of 
a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 3 2D, 
DA2, DA1G, T10, B9 , B9/11, BaF3 , MC9/G, M+(preB M+) , 2E8, 

15 RB5, DAI, 123, T1165, HT2, CTLL2 , TF-1, Mo7e and CMK. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Assays for T-cell or thymocyte proliferation include 
without limitation those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 

20 Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro assays 
for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, 
Immunologic studies in Humans); Takai et al., J. Immunol. 
137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 

25 133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778- 
3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of 
spleen cells, lymph node cells or thymocytes include, without 
limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current 

30 Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 
3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human interleukin .gamma., 
Schreiber, R. D. In Current Protocols in Immunology. J. E. 
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e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, 
Toronto. 1994. 

Assays for proliferation and differentiation of 
hematopoietic and lymphopoietic cells include, without 
5 limitation, those described in: Measurement of Human and 

Murine Inter leukin 2 and Inter leukin 4, Bottomly, K. , Davis, 
L. S. and Lipsky, P. E. In Current Protocols in Immunology. 
J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley 
and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 
173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; 

10 Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931- 
2938, 1983; Measurement of mouse and human interleukin 6 — 
Nordan, R. In Current Protocols in Immunology. J. E. e.a. 
Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, 
Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. U.S.A. 
83:1857-1861, 1986; Measurement of human Interleukin 11 — 

15 Bennett, F., Giannotti, J., Clark, S. C. and Turner, K. J. In 
Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 
1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; Measurement 
of mouse and human Interleukin 9 — Ciarletta, A., Giannotti, 
J., Clark, S. C. and Turner, K. J. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.13.1, John 

20 Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which 
will identify, among others, proteins that affect APC-T cell 
interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without 
limitation, those described in: Current Protocols in 

25 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 

Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro assays 
for Mouse Lymphocyte Function; Chapter 6, Cytokines and their 
cellular receptors; Chapter 7, Immunologic studies in 
Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 

30 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 11:405- 
411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; 
Takai et al., J. Immunol. 140:508-512, 1988. 
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5.5.4. Immune Stimulating or Suppressing Activity 



A protein of the present invention may also exhibit 
immune stimulating or immune suppressing activity, including 
without limitation the activities for which assays are 
5 described herein. A polynucleotide of the invention can 
encode a polypeptide exhibiting such activities. A protein 
may be useful in the treatment of various immune deficiencies 
and disorders (including severe combined immunodeficiency 
(SCID) ) , e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting 

10 the cytolytic activity of NK cells and other cell 

populations. These immune deficiencies may be genetic or be 
caused by vital (e.g. , HIV) as well as bacterial or fungal 
infections, or may result from autoimmune disorders. More 
specifically, infectious diseases causes by viral, bacterial, 
fungal or other infection may be treatable using a protein of 

IS the present invention, including infections by HIV, hepatitis 
viruses, herpesviruses, mycobacteria, Leishmania spp. , 
malaria spp. and various fungal infections such as 
candidiasis. Of course, in this regard, a protein of the 
present invention may also be useful where a boost to the 
immune system generally may be desirable, i.e., in the 

20 treatment of cancer. 

Autoimmune disorders which may be treated using a 
protein of the present invention include, for example, 
connective tissue disease, multiple sclerosis, systemic lupus 
erythematosus, rheumatoid arthritis, autoimmune pulmonary 
inflammation, Guillain-Barre syndrome, autoimmune 

25 thyroiditis, insulin dependent diabetes mellitis, myasthenia 
gravis, graf t-versus-host disease and autoimmune inflammatory 
eye disease. Such a protein of the present invention may also 
to be useful in the treatment of allergic reactions and 
conditions, such as asthma (particularly allergic asthma) or 
other respiratory problems. Other conditions, in which immune 

30 suppression is desired (including, for example, organ 

transplantation) , may also be treatable using a protein of 
the present invention. 
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Using the proteins of the invention it may also be 
possible to immune responses, in a number of ways. Down 
regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing 

5 the induction of an immune response. The functions of 
activated T cells may be inhibited by suppressing T cell 
responses or by inducing specific tolerance in T cells, or 
both. Immunosuppression of T cell responses is generally an 
active, non-antigen-specific, process which requires 
continuous exposure of the T cells to the suppressive agent. 

10 Tolerance, which involves inducing non-responsiveness or 

anergy in T cells, is distinguishable from immunosuppression 
in that it is generally antigen-specific and persists after 
exposure to the tolerizing agent has ceased. Operationally, 
tolerance can be demonstrated by the lack of a T cell 
response upon reexposure to specific antigen in the absence 

15 of the tolerizing agent. 

Down regulating or preventing one or more antigen 
functions (including without limitation B lymphocyte antigen 
functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be 
useful in situations of tissue, skin and organ 

20 transplantation and in graf t-versus-host disease (GVHD) . For 
example, blockage of T cell function should result in reduced 
tissue destruction in tissue transplantation. Typically, in 
tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an 
immune reaction that destroys the transplant. The 

25 administration of a molecule which inhibits or blocks 
interaction of a B7 lymphocyte antigen with its natural 
ligand(s) on immune cells (such as a soluble, monomer ic form 
of a peptide having B7-2 activity alone or in conjunction 
with a monomeric form of a peptide having an activity of 
another B lymphocyte antigen (e.g., B7-1, B7-3) or blocking 

30 antibody) , prior to transplantation can lead to the binding 
of the molecule to the natural ligand(s) on the immune cells 
without transmitting the corresponding costimulatory signal. 
Blocking B lymphocyte antigen function in this matter 
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prevents cytokine synthesis by immune cells, such as T cells, 
and thus acts as an immunosuppressant. Moreover, the lack of 
costimulation may also be sufficient to anergize the T cells, 
thereby inducing tolerance in a subject. Induction of long- 
5 term tolerance by B lymphocyte antigen-blocking reagents may 
avoid the necessity of repeated administration of these 
blocking reagents. To achieve sufficient immunosuppression or 
tolerance in a subject, it may also be necessary to block the 
function of a combination of B lymphocyte antigens. 

The efficacy of particular blocking reagents in 

10 preventing organ transplant rejection or GVHD can be assessed 
using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used 
include allogeneic cardiac grafts in rats and xenogeneic 
pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig 

15 fusion proteins in vivo as described in Lenschow et al., 
Science 257:789-792 (1992) and Turka et al., Proc. Natl. 
Acad. Sci USA, 89:11102-11105 (1992). In addition, murine 
models of GVHD (see Paul ed. , Fundamental Immunology, Raven 
Press, New York, 1989, pp. 846-847) can be used to determine 
the effect of blocking B lymphocyte antigen function in vivo 

20 °n the development of that disease. 

Blocking antigen function may also be therapeutically 
useful for treating autoimmune diseases. Many autoimmune 
disorders are the result of inappropriate activation of T 
cells that are reactive against self tissue and which promote 
the production of cytokines and autoantibodies involved in 

25 the pathology of the diseases. Preventing the activation of 
autoreactive T cells may reduce or eliminate disease 
symptoms. Administration of reagents which block 
costimulation of T cells by disrupting receptor : ligand 
interactions of B lymphocyte antigens can be used to inhibit 
T cell activation and prevent production of autoantibodies or 

30 T cell-derived cytokines which may be involved in the disease 
process. Additionally, blocking reagents may induce antigen- 
specific tolerance of autoreactive T cells which could lead 
to long-term relief from the disease. The efficacy of 
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blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a number of well- 
characterized animal models of human autoimmune diseases. 
Examples include murine experimental autoimmune encephalitis, 

5 systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid 
mice, murine autoimmune collagen arthritis, diabetes mellitus 
in NOD mice and BB rats, and murine experimental myasthenia 
gravis (see Paul ed. , Fundamental Immunology, Raven Press, 
New York, 1989, pp. 840-856). 

Upregulation of an antigen function (preferably a B 

10 lymphocyte antigen function) , as a means of up regulating 

immune responses, may also be useful in therapy. Upregulation 
of immune responses may be in the form of enhancing an 
existing immune response or eliciting an initial immune 
response. For example, enhancing an immune response through 
stimulating B lymphocyte antigen function may be useful in 

15 cases of viral infection. In addition, systemic viral 

diseases such as influenza, the common cold, and encephalitis 
might be alleviated by the administration of stimulatory 
forms of B lymphocyte antigens systemically. 

Alternatively, anti-vital immune responses may be 
enhanced in an infected patient by removing T cells from the 

20 patient, costimulating the T cells in vitro with viral 
antigen-pulsed APCs either expressing a peptide of the 
present invention or together with a stimulatory form of a 
soluble peptide of the present invention and reintroducing 
the in vitro activated T cells into the patient. Another 
method of enhancing anti-viral immune responses would be to 

25 isolate infected cells from a patient, transfect them with a 
nucleic acid encoding a protein of the present invention as 
described herein such that the cells express all or a portion 
of the protein on their surface, and reintroduce the 
transfected cells into the patient. The infected cells would 
now be capable of delivering a costimulatory signal to, and 

30 thereby activate, T cells in vivo. 

The presence of the peptide of the present invention 
having the activity of a B lymphocyte antigen (s) on the 
surface of the tumor cell provides the necessary 
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costimulation signal to T cells to induce a T cell mediated 
immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II 
molecules, or which fail to reexpress sufficient mounts of 

5 MHC class I or MHC class II molecules, can be transfected 
with nucleic acid encoding all or a portion of (e.g. , a 
cytoplasmic-domain truncated portion) of an MHC class I a 
•alpha, chain protein and .beta.. sub. 2 microglobulin protein 
or an MHC class II .alpha, chain protein and an MHC class II 
.beta, chain protein to thereby express MHC class I or MHC 

10 class II proteins on the cell surface. Expression of the 
appropriate class I or class II MHC in conjunction with a 
peptide having the activity of a B lymphocyte antigen (e.g., 
B7-1, B7-2, B7-3) induces a T cell mediated immune response 
against the transfected tumor cell. Optionally, a gene 
encoding an antisense construct which blocks expression of an 

15 MHC class II associated protein, such as the invariant chain, 
can also be cotransf ected with a DNA encoding a peptide 
having the activity of a B lymphocyte antigen to promote 
presentation of tumor associated antigens and induce tumor 
specific immunity. Thus, the induction of a T cell mediated 
immune response in a human subject may be sufficient to 

20 overcome tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity 
include, without limitation, those described in: Current 
Protocols in Immunology, Ed by J. E. Coligan, A. M. 

25 Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. 
Greene Publishing Associates and Wiley-Interscience (Chapter 
3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al. , 
Proc. Natl. Acad. Sci. USA 78:24 88-2492, 1981; Herrmann et 
al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 

30 Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 

137:3494-3500, 1986; Takai et al. , J. Immunol. 140:508-512, 
1988; Herrmann et al. , Proc. Natl. Acad. Sci. USA 78:2488- 
2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; 



Handa et al., J. Immunol. 135:1564-1572, 1985; Takai et al., 
J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. Virology 
61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; 
Bertagnolli et al., Cellular Immunology 133:327-341, 1991; 
5 Brown et al. , J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and 
isotype switching (which will identify, among others, 
proteins that modulate T-cell dependent antibody responses 
and that affect Thl/Th2 profiles) include, without 
limitation, those described in: Maliszewski, J. Immunol. 

10 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J. J. and Brunswick, M. In Current 
Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 
3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will 
identify, among others, proteins that generate predominantly 

15 Thl and CTL responses) include, without limitation, those 
described in: Current Protocols in Immunology, Ed by J. E. 
Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. 
Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); 

20 Takai et al. , J. Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140:508-512, 1988; Bertagnolli et al. , J. 
Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, 
among others, proteins expressed by dendritic cells that 
activate naive T-cells) include, without limitation, those 

25 described in: Guery et al., J. Immunol. 134:536-544, 1995; 
Inaba et al. , Journal of Experimental Medicine 173:549-559, 
1991; Macatonia et al., Journal of Immunology 154:5071-5079, 
1995; Porgador et al., Journal of Experimental Medicine 
182:255-260, 1995; Nair et al., Journal of Virology 67:4062- 
4069, 1993; Huang et al. , Science 264:961-965, 1994; 

30 Macatonia et al. , Journal of Experimental Medicine 169:1255- 
1264, 1989; Bhardwaj et al., Journal of Clinical 
Investigation 94:797-807, 1994; and Inaba et al., Journal of 
Experimental Medicine 172:631-640, 1990. 
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Assays for lymphocyte survival/apoptosis (which will 
identify, among others, proteins that prevent apoptosis after 
superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: 

5 Darzynkiewicz et al., Cytometry 13:795-808, 1992; Gorczyca et 
al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer 
Research 53:1945-1951, 1993; Itoh et al. , Cell 66:233-243, 
1991; Zacharchuk, Journal of Immunology 145:4037-4045, 1990; 
Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., 
International Journal of Oncology 1:639-648, 1992. 

10 Assays for proteins that influence early steps of T-cell 

commitment and development include, without limitation, those 
described in: Antica et al., Blood 84:111-117, 1994; Fine et 
al., Cellular Immunology 155:111-122, 1994; Galy et al. , 
Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. 
USA 88:7548-7551, 1991. 

15 

5.5.5. Hematopoiesis Regulating Activity 

A protein of the present invention may be useful in 
regulation of hematopoiesis and, consequently, in the 
treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming 

20 cells or of factor-dependent cell lines indicates involvement 
in regulating hematopoiesis, e.g. in supporting the growth 
and proliferation of erythroid progenitor cells alone or in 
combination with other cytokines, thereby indicating utility, 
for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the 

25 production of erythroid precursors and/or erythroid cells; in 
supporting the growth and proliferation of myeloid cells such 
as granulocytes and monocytes/macrophages (i.e., traditional 
CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo- 
suppression; in supporting the growth and proliferation of 

30 megakaryocytes and consequently of platelets thereby allowing 
prevention or treatment of various platelet disorders such as 
thrombocytopenia, and generally for use in place of or 
complimentary to platelet transfusions; and/or in supporting 
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the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic 
utility in various stem cell disorders (such as those usually 

5 treated with transplantation, including, without limitation, 
aplastic anemia and paroxysmal nocturnal hemoglobinuria) , as 
well as in repopulating the stem cell compartment post 
irradiation/ chemotherapy, either in-vivo or ex-vivo (i.e., in 
conjunction with bone marrow transplantation or with 
peripheral progenitor cell transplantation (homologous or 

10 heterologous) ) as normal cells or genetically manipulated for 
gene therapy. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Suitable assays for proliferation and differentiation of 
various hematopoietic lines are cited above. 

15 Assays for embryonic stem cell differentiation (which 

will identify, among others, proteins that influence 
embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular 
Biology 15:141-151, 1995; Keller et al . , Molecular and 
Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 

20 81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which 
will identify, among others, proteins that regulate lympho- 
hematopoiesis) include, without limitation, those described 
in: Methylcellulose colony forming assays, Freshney, M. G. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. 

25 Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; 
Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 
1992; Primitive hematopoietic colony forming cells with high 
proliferative potential, McNiece, I. K. and Briddell, R. A. 
In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; 

30 Neben et al., Experimental Hematology 22:353-359, 1994; 
Cobblestone area forming cell assay, Ploemacher, R. E. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. 
Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long 
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term bone marrow cultures in the presence of stromal cells, 
Spooncer, E. , Dexter, M. and Allen, T. In Culture of 
Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163- 
179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
5 initiating cell assay, Sutherland, H. J. In Culture of 

Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139- 
162, Wiley-Liss, Inc., New York, N.Y. 1994. 

5.5.6. Tissue Growth Activity 

A protein of the present invention also may have utility 

10 in compositions used for bone, cartilage, tendon, ligament 
and/or nerve tissue growth or regeneration, as well as for 
wound healing and tissue repair and replacement, and in the 
treatment of burns, incisions and ulcers. 

A protein of the present invention, which induces 
cartilage and/ or bone growth in circumstances where bone is 

15 not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other 
animals. Such a preparation employing a protein of the 
invention may have prophylactic use in closed as well as open 
fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an 

20 osteogenic agent contributes to the repair of congenital, 
trauma induced, or oncologic resection induced craniofacial 
defects, and also is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the 
treatment of periodontal disease, and in other tooth repair 
processes. Such agents may provide an environment to attract 

25 bone-forming cells, stimulate growth of bone-forming cells or 
induce differentiation of progenitors of bone-forming cells. 
A protein of the invention may also be useful in the 
treatment of osteoporosis or osteoarthritis, such as through 
stimulation of bone and/or cartilage repair or by blocking 
inflammation or processes of tissue destruction (collagenase 

30 activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

Another category of tissue regeneration activity that 
may be attributable to the protein of the present invention 
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is tendon/ ligament formation. A protein of the present 
invention, which induces tendon/ ligament-like tissue or other 
tissue formation in circumstances where such tissue is not 
normally formed, has application in the healing of tendon or 

5 ligament tears, deformities and other tendon or ligament 
defects in humans and other animals. Such a preparation 
employing a tendon/ ligament- like tissue inducing protein may 
have prophylactic use in preventing damage to tendon or 
ligament tissue, as well as use in the improved fixation of 
tendon or ligament to bone or other tissues, and in repairing 

10 defects to tendon or ligament tissue. De novo 

tendon/ligament-like tissue formation induced by a 
composition of the present invention contributes to the 
repair of congenital, trauma induced, or other tendon or 
ligament defects of other origin, and is also useful in 
cosmetic plastic surgery for attachment or repair of tendons 

15 or ligaments. The compositions of the present invention may 
provide environment to attract tendon- or ligament- forming 
cells, stimulate growth of tendon- or ligament-forming cells, 
induce differentiation of progenitors of tendon- or ligament- 
forming cells, or induce growth of tendon/ ligament cells or 
progenitors ex vivo for return in vivo to effect tissue 

20 repair. The compositions of the invention may also be useful 
in the treatment of tendinitis, carpal tunnel syndrome and 
other tendon or ligament defects. The compositions may also 
include an appropriate matrix and/ or sequestering agent as a 
carrier as is well known in the art. 

The protein of the present invention may also be useful 

25 for proliferation of neural cells and for regeneration of 

nerve and brain tissue, i.e. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well 
as mechanical and traumatic disorders, which involve 
degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a protein may be used in the 

30 treatment of diseases of the peripheral nervous system, such 
as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, 
such as Alzheimer's, Parkinson's disease, Huntington's 
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disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in 
accordance with the present invention include mechanical and 
traumatic disorders, such as spinal cord disorders, head 
5 trauma and cerebrovascular diseases such as stroke. 

Peripheral neuropathies resulting from chemotherapy or other 
medical therapies may also be treatable using a protein of 
the invention. 

Proteins of the invention may also be useful to promote 
better or faster closure of non-healing wounds, including 
10 without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and 
the like. 

It is expected that a protein of the present invention 
may also exhibit activity for generation or regeneration of 
other tissues, such as organs (including, for example, 
15 pancreas, liver, intestine, kidney, skin, endothelium) , 

muscle (smooth, skeletal or cardiac) and vascular (including 
vascular endothelium) tissue, or for promoting the growth of 
cells comprising such tissues. Part of the desired effects 
may be by inhibition or modulation of fibrotic scarring to 
allow normal tissue to regenerate. A protein of the invention 
20 may also exhibit angiogenic activity. 

A protein of the present invention may also be useful 
for gut protection or regeneration and treatment of lung or 
liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful 
25 for promoting or inhibiting differentiation of tissues 
described above from precursor tissues or cells; or for 
inhibiting the growth of tissues described above. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Assays for tissue generation activity include, without 
30 limitation, those described in: International Patent 
Publication No. WO95/16035 (bone, cartilage, tendon) ; 
International Patent Publication No. WO95/05846 (nerve, 
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neuronal) ; International Patent Publication No. WO91/07491 
(skin, endothelium) , 

Assays for wound healing activity include, without 
limitation, those described in: Winter, Epidermal Wound 
5 Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T . , eds.)/ 
Year Book Medical Publishers, Inc., Chicago, as modified by 
Eaglstein and Mertz, J. Invest. Dermatol 71:382-84 (1978). 

5.5.7. Act ivin/ Inhibin activity 

A protein of the present invention may also exhibit 

10 activin- or inhibin-related activities. A polynucleotide of 
the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability 
to inhibit the release of follicle stimulating hormone (FSH) , 
while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH) . 

15 Thus, a protein of the present invention, alone or in 

heterodimers with a member of the inhibin a-family, may be 
useful as a contraceptive based on the ability of inhibins tc 
decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient 
amounts of other inhibins can induce infertility in these 

20 mammals. Alternatively, the protein of the invention, as a 
homodimer or as a heterodimer with other protein subunits of 
the inhibin-p group, may be useful as a fertility inducing 
therapeutic, based upon the ability of activin molecules in 
stimulating FSH release from cells of the anterior pituitary. 
See, for example, U.S. Pat. No. 4,798,885. A protein of the 

25 invention may also be useful for advancement of the onset of 
fertility in sexually immature mammals, so as to increase the 
lifetime reproductive performance of domestic animals such as 
cows, sheep and pigs. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

30 Assays for activin/ inhibin activity include, without 

limitation, those described in: Vale et al., Endocrinology 
91:562-572, 1972; Ling et al. , Nature 321:779-782, 1986; Vale 
et al., Nature 321:776-779, 1986; Mason et al. , Nature 
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5.5.8. Chemotactic/Chemokinetic Activity 

5 A protein of the present invention may have chemotactic 

or chemokinetic activity (e.g., act as a chemokine) for 
mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, 
epithelial and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such 

10 attributes. Chemotactic and chemokinetic proteins can be 
used to mobilize or attract a desired cell population to a 
desired site of action. Chemotactic or chemokinetic proteins 
provide particular advantages in treatment of wounds and 
other trauma to tissues, as well as in treatment of localized 
infections. For example, attraction of lymphocytes, monocytes 

IS or neutrophils to tumors or sites of infection may result in 
improved immune responses against the tumor or infecting 
agent. 

A protein or peptide has chemotactic activity for a 
particular cell population if it can stimulate, directly or 
indirectly, the directed orientation or movement of such cell 
20 population. Preferably, the protein or peptide has the 
ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a 
population of cells can be readily determined by employing 
such protein or peptide in any known assay for cell 
chemotaxis . 

25 The activity of a protein of the invention may, among 

other means, be measured by the following methods: 

Assays for chemotactic activity (which will identify 
proteins that induce or prevent chemotaxis) consist of assays 
that measure the ability of a protein to induce the migration 
of cells across a membrane as well as the ability of a 

30 protein to induce the adhesion of one cell population to 
another cell population. Suitable assays for movement and 
adhesion include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. 
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Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. 
Greene Publishing Associates and Wiley-Interscience (Chapter 
6.12, Measurement of alpha and beta Chemokines 6 . 12 . 1- 
6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; 
5 Lind et al. APHIS 103:14 0-146, 1995; Muller et al Eur. J. 

Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860- 
5867, 1994; Johnston et al. J. of Immunol. 153:1762-1768, 

1994. 

5.5.9. Hemostatic and Thrombolytic Activity 

10 A protein of the invention may also exhibit hemostatic 

or thrombolytic activity. A polynucleotide of the invention 
can encode a polypeptide exhibiting such attributes. Such a 
protein is expected to be useful in treatment of various 
coagulation disorders (including hereditary disorders, such 
as hemophilias) or to enhance coagulation and other 

15 hemostatic events in treating wounds resulting from trauma, 
surgery or other causes. A protein of the invention may also 
be useful for dissolving or inhibiting formation of 
thromboses and for treatment and prevention of conditions 
resulting therefrom (such as, for example, infarction of 
cardiac and central nervous system vessels (e.g., stroke). 

20 The activity of a protein of the invention may, among 

other means, be measured by the following methods: 

Assay for hemostatic and thrombolytic activity include, 
without limitation, those described in: Linet et al., J. 
Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 

25 (1991); Schaub, Prostaglandins 35:467-474, 1988. 

5.5.10. Receptor /Liqand Activity 

A protein of the present invention may also demonstrate 
activity as receptors, receptor ligands or inhibitors or 
agonists of receptor/ ligand interactions. A polynucleotide 
^ of the invention can encode a polypeptide exhibiting such 
characteristics. Examples of such receptors and ligands 
include, without limitation, cytokine receptors and their 
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ligands, receptor kinases and their ligands, receptor 
phosphatases and their ligands, receptors involved in cell- 
cell interactions and their ligands (including without 
limitation, cellular adhesion molecules (such as selectins, 

5 integrins and their ligands) and receptor/ ligand pairs 
involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses) . 
Receptors and ligands are also useful for screening of 
potential peptide or small molecule inhibitors of the 
relevant receptor /ligand interaction. A protein of the 

10 present invention (including, without limitation, fragments 
of receptors and ligands) may themselves be useful as 
inhibitors of receptor/ ligand interactions. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Suitable assays for receptor-ligand activity include 

15 without limitation those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 7.28, Measurement 
of Cellular Adhesion under static conditions 7.28.1-7.28.22), 
Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 

20 Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein 
et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. 
Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661- 
670, 1995. 

5.5.11. Anti-Inflammatory Activity 

25 Proteins of the present invention may also exhibit anti- 

inflammatory activity. The anti-inflammatory activity may be 
achieved by providing a stimulus to cells involved in the 
inflammatory response, by inhibiting or promoting cell-cell 
interactions (such as, for example, cell adhesion) , by 
inhibiting or promoting chemotaxis of cells involved in the 

30 inflammatory process, inhibiting or promoting cell 

extravasation, or by stimulating or suppressing production of 
other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities 



-61- 



can be used to treat inflammatory conditions including 
chronic or acute conditions) , including without limitation 
intimation associated with infection (such as septic shock, 
sepsis or systemic inflammatory response syndrome (SIRS)), 

5 ischemia-reperfusion injury, endotoxin lethality, arthritis, 
complement-mediated hyperacute rejection, nephritis, cytokine 
or chemokine- induced lung injury, inflammatory bowel disease, 
Crohn's disease or resulting from over production of 
cytokines such as TNF or IL-1. Proteins of the invention may 
also be useful to treat anaphylaxis and hypersensitivity to 

10 an antigenic substance or material. 

5.5.12. Leukemias 

Leukemias and related disorders may be treated or 
prevented by administration of a therapeutic that promotes or 
inhibits function of the polynucleotides and/or polypeptides 

15 of the invention. Such leukemias and related disorders 
include but are not limited to acute leukemia, acute 
lymphocytic leukemia, acute myelocytic leukemia, 
myeloblastic, promyelocytic, myelomonocytic, monocytic, 
erythro leukemia, chronic leukemia, chronic myelocytic 
(granulocytic) leukemia and chronic lymphocytic leukemia (for 

20 a review of such disorders, see Fishman et al., 1985, 
Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 



5.5.13. Nervous System Disorders 

Nervous system disorders, involving cell types which can 
be tested for efficacy of intervention with compounds that 
modulate the activity of the polynucleotides and/ or 
polypeptides of the invention, and which can be treated upon 
thus observing an indication of therapeutic utility, include 
but are not limited to nervous system injuries, and diseases 
or disorders which result in either a disconnection of axons, 
a diminution or degeneration of neurons, or demyelination. 
Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according 
to the invention include but are not limited to the following 
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lesions of either the central (including spinal cord, brain) 

or peripheral nervous systems: 

(i) traumatic lesions, including lesions caused by 
physical injury or associated with surgery, 
for example, lesions which sever a portion of 
the nervous system, or compression injuries; 
(ii) ischemic lesions, in which a lack of oxygen in 
a portion of the nervous system results in 
neuronal injury or death, including cerebral 
infarction or ischemia, or spinal cord 
infarction or ischemia; 
(iii) infectious lesions, in which a portion of the 
nervous system is destroyed or injured as a 
result of infection, for example, by an 
abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or 
herpes simplex virus or with Lyme disease, 
tuberculosis, syphilis; 
(iv) degenerative lesions, in which a portion of 

the nervous system is destroyed or injured as 
a result of a degenerative process including 
but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, 
Huntington's chorea, or amyotrophic lateral 
sclerosis; 

(v) lesions associated with nutritional diseases 
or disorders, in which a portion of the 
nervous system is destroyed or injured by a 
nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 
deficiency, folic acid deficiency, Wernicke 
disease, tobacco-alcohol amblyopia, 
Marchiaf ava-Bignami disease (primary 
degeneration of the corpus callosum) , and 
alcoholic cerebellar degeneration; 
(vi) neurological lesions associated with systemic 
diseases including but not limited to diabetes 
(diabetic neuropathy, Bell's palsy), systemic 



lupus erythematosus, carcinoma, or 
sarcoidosis; 

(vii) lesions caused by toxic substances including 
alcohol, lead, or particular neurotoxins; and 
5 (viii) demyelinated lesions in which a portion of the 

nervous system is destroyed or injured by a 
demyelinating disease including but not 
limited to multiple sclerosis, human 
immunodeficiency virus-associated myelopathy, 
transverse myelopathy or various etiologies, 
10 progressive multifocal leukoencephalopathy , 

and central pontine myelinolysis. 
Therapeutics which are useful according to the 
invention for treatment of a nervous system disorder may be 
selected by testing for biological activity in promoting the 
survival or differentiation of neurons. For example, and not 
15 by way of limitation, therapeutics which elicit any of the 
following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 
(ii) increased sprouting of neurons in culture or 
in vivo; 

(iii) increased production of a neuron-associated 
20 molecule in culture or in vivo, e.g., choline 

acetyltransferase or acetylcholinesterase with 
respect to motor neurons; or 
(iv) decreased symptoms of neuron dysfunction in 
vivo. 

2g Such effects may be measured by any method known in the art. 
In preferred, non-limiting embodiments, increased survival of 
neurons may be measured by the method set forth in Arakawa et 
al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of 
neurons may be detected by methods set forth in Pestronk et 
al. (1980, Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. 

30 Rev. Neurosci. 4:17-42); increased production of neuron- 
associated molecules may be measured by bioassay, enzymatic 
assay, antibody binding, Northern blot assay, etc., depending 
on the molecule to be measured; and motor neuron dysfunction 
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may be measured by assessing the physical manifestation of 
motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In a specific embodiments, motor neuron disorders 

5 that may be treated according to the invention include but 
are not limited to disorders such as infarction, infection, 
exposure to toxin, trauma, surgical damage, degenerative 
disease or malignancy that may affect motor neurons as well 
as other components of the nervous system, as well as 
disorders that selectively affect neurons such as amyotrophic 

10 lateral sclerosis, and including but not limited to 

progressive spinal muscular atrophy, progressive bulbar 
palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood 
(Fazio-Londe syndrome) , poliomyelitis and the post polio 
syndrome, and Hereditary Motor sensory Neuropathy ( Charcot - 

15 Marie-Tooth Disease) . 

5.5.14. Other Activities 

A protein of the invention may also exhibit one or more 
of the following additional activities or effects: inhibiting 
the growth, infection or function of, or killing, infectious 

^ agents, including, without limitation, bacteria, viruses, 
fungi and other parasites; effecting (suppressing or 
enhancing) bodily characteristics, including, without 
limitation, height, weight, hair color, eye color, skin, fat 
to lean ratio or other tissue pigmentation, or organ or body 
part size or shape (such as, for example, breast augmentation 

^ or diminution, change in bone form or shape) ; effecting 
biorhythms or caricadic cycles or rhythms; effecting the 
fertility of male or female subjects; effecting the 
metabolism, catabolism, anabolism, processing, utilization, 
storage or elimination of dietary fat, lipid, protein, 
carbohydrate, vitamins, minerals, co-factors or other 

30 nutritional factors or component (s) ; effecting behavioral 
characteristics, including, without limitation, appetite, 
libido, stress, cognition (including cognitive disorders) , 
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depression (including depressive disorders) and violent 
behaviors; providing analgesic effects or other pain reducing 
effects; promoting differentiation and growth of embryonic 
stem cells in lineages other than hematopoietic lineages; 

5 hormonal or endocrine activity; in the case of enzymes, 
correcting deficiencies of the enzyme and treating 
deficiency-related diseases; treatment of hyperprolif erative 
disorders (such as, for example, psoriasis) ; immunoglobul in- 
like activity (such as, for example, the ability to bind 
antigens or complement); and the ability to act as an antigen 

10 in a vaccine composition to raise an immune response against 
such protein or another material or entity which is cross- 
reactive with such protein. 

5.6. Pharmaceutical Formulations and Routes of 
Administration 

15 A protein of the present invention (from whatever source 

derived, including without limitation from recombinant and 
non-recombinant sources) may be administered to a patient in 
need, by itself, or in pharmaceutical compositions where it 
is mixed with suitable carriers or excipient(s) at doses to 
treat or ameliorate a variety of disorders. Such a 
20 composition may also contain (in addition to protein and a 
carrier) diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials well known in the art. The 
term "pharmaceutically acceptable" means a non-toxic material 
that does not interfere with the effectiveness of the 
biological activity of the active ingredient { s) . The 
25 characteristics of the carrier will depend on the route of 
administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other 
hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, 
IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL- 
12, IL-13, IL-14, IL-15, I FN, TNFO, TNF1, TNF2 , G-CSF, Meg- 
30 CSF, thrombopoietin, stem cell factor, and erythropoietin. 
The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or 
compliment its activity or use in treatment. Such additional 
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factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein of 
the invention, or to minimize side effects. Conversely, 
protein of the present invention may be included in 
5 formulations of the particular cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, 
or ant i- inflammatory agent to minimize side effects of the 
cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory 
agent. A protein of the present invention may be active in 

10 multimers (e.g., heterodimers or homodimers) or complexes 
with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the 
invention in such multimeric or complexed form. 

Techniques for formulation and administration of the 
compounds of the instant application may be found in 

15 "Remington's Pharmaceutical Sciences," Mack Publishing Co., 
Easton, PA, latest edition. A therapeutically effective dose 
further refers to that amount of the, compound sufficient to 
result in amelioration of symptoms, e.g., treatment, healing, 
prevention or amelioration of the relevant medical condition, 
or an increase in rate of treatment, healing, prevention or 

20 amelioration of such conditions. When applied to an 
individual active ingredient, administered alone, a 
therapeutically effective dose refers to that ingredient 
alone. When applied to a combination, a therapeutically 
effective dose refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether 

25 administered in combination, serially or simultaneously. 

In practicing the method of treatment or use of the 
present invention, a therapeutically effective amount of 
protein of the present invention is administered to a mammal 
having a condition to be treated. Protein of the present 
invention may be administered in accordance with the method 

30 of the invention either alone or in combination with other 
therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one 
or more cytokines, lymphokines or other hematopoietic 
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factors, protein of the present invention may be administered 
either simultaneously with the cytokine (s) , lymphokine (s) , 
other hematopoietic factor (s), thrombolytic or anti- 
thrombotic factors, or sequentially. If administered 
5 sequentially, the attending physician will decide on the 

appropriate sequence of administering protein of the present 
invention in combination with cytokine (s), lymphokine ( s ) , 
other hematopoietic factor (s), thrombolytic or anti- 
thrombotic factors. 

10 5.6.1. Routes of Administration 

Suitable routes of administration may, for example, 
include oral, rectal, transmucosal, or intestinal 
administration; parenteral delivery, including intramuscular, 
subcutaneous, intramedullary injections, as well as 
intrathecal, direct intraventricular, intravenous, 

15 intraperitoneal, intranasal, or intraocular 

injections. Administration of protein of the present invention 
used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, 
inhalation, topical application or cutaneous, subcutaneous, 

20 intraperitoneal, parenteral or intravenous injection. 
Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local 
rather than systemic manner, for example, via injection of 
the compound directly into a arthritic joints or in fibrotic 
tissue, often in a depot or sustained release formulation. 

25 in order to prevent the scarring process frequently occurring 
as complication of glaucoma surgery, the compounds may be 
administered topically, for example, as eye 

drops . Furthermore , one may administer the drug in a targeted 
drug delivery system, for example, in a liposome coated with 
a specific antibody, targeting, for example, arthritic or 
30 fibrotic tissue. The liposomes will be targeted to and taken 
up selectively by the afflicted tissue. 
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5.6.2. Compositions /Formulations 

Pharmaceutical compositions for use in accordance with 
the present invention thus may be formulated in a 
conventional manner using one or more physiologically 
5 acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into 
preparations which can be used pharmaceutical ly. These 
pharmaceutical compositions may be manufactured in a manner 
that is itself known, e.g., by means of conventional mixing, 
dissolving, granulating, dragee-making, levigating, 

^ emulsifying, encapsulating, entrapping or lyophilizing 

processes . Proper formulation is dependent upon the route of 
administration chosen. When a therapeutically effective 
amount of protein of the present invention is administered 
orally, protein of the present invention will be in the form 
of a tablet, capsule, powder, solution or elixir. When 

* 5 administered in tablet form, the pharmaceutical composition 
of the invention may additionally contain a solid carrier 
such as a gelatin or an adjuvant. The tablet, capsule, and 
powder contain from about 5 to 95% protein of the present 
invention, and preferably from about 25 to 90% protein of the 
present invention. When administered in liquid form, a liquid 

^ carrier such as water, petroleum, oils of animal or plant 
origin such as peanut oil, mineral oil, soybean oil, or 
sesame oil, or synthetic oils may be added. The liquid form 
of the pharmaceutical composition may further contain 
physiological saline solution, dextrose or other saccharide 
solution, or glycols such as ethylene glycol, propylene 

^ glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 0.5 
to 90% by weight of protein of the present invention, and 
preferably from about 1 to 50% protein of the present 
invention. 

When a therapeutically effective amount of protein of 
^ the present invention is administered by intravenous, 

cutaneous or subcutaneous injection, protein of the present 
invention will be in the form of a pyrogen-f ree, parenteral ly 
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acceptable aqueous solution. The preparation of such 
parenterally acceptable protein solutions, having due regard 
to pH, isotonicity, stability, and the like, is within the 
skill in the art. A preferred pharmaceutical composition for 
5 intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein of the present invention, an 
isotonic vehicle such as Sodium Chloride Injection, Ringer's 
Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as 
known in the art. The pharmaceutical composition of the 
10 present invention may also contain stabilizers, 

preservatives, buffers, antioxidants, or other additives 
known to those of skill in the art. For injection, the agents 
of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as 
Hanks 's solution, Ringer's solution, or physiological saline 
15 buffer. For transmucosal administration, penetrants 

appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art. 

For oral administration, the compounds can be formulated 
readily by combining the active compounds with 
pharmaceutically acceptable carriers well known in the art. 
20 Such carriers enable the compounds of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions and the like, for oral 
ingestion by a patient to be treated. Pharmaceutical 
preparations for oral use can be obtained solid excipient, 
optionally grinding a resulting mixture, and processing the 
25 mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable 
excipients are, in particular, fillers such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat 
starch, rice starch, potato starch, gelatin, gum tragacanth, 
30 methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP) . 
If desired, disintegrating agents may be added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or 
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a salt thereof such as sodium alginate. Dragee cores are 
provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used, which may 
optionally contain gum arabic, talc, polyvinyl pyrrolidone, 
5 carbopol gel, polyethylene glycol, and/or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent 
mixtures. Dyestuffs or pigments may be added to the tablets 
or dragee coatings for identification or to characterize 
different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally 
10 include push-fit capsules made of gelatin, as well as soft, 
sealed capsules made of gelatin and a plasticizer, such as 
glycerol or sorbitol. The push-fit capsules can contain the 
active ingredients in admixture with filler such as lactose, 
binders such as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In soft 
15 capsules, the active compounds may be dissolved or suspended 
in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols. In addition, stabilizers may be 
added. All formulations for oral administration should be in 
dosages suitable for such administration. For buccal 
administration, the compositions may take the form of tablets 
20 or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use 
according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 
packs or a nebuliser, with the use of a suitable propellant, 
e.g., dichlorodif luoromethane, trichlorof luoromethane, 
25 dichlorotetraf luoroethane, carbon dioxide or other suitable 
gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of, e.g., gelatin for use in 
an inhaler or insufflator may be formulated containing a 
powder mix of the compound and a suitable powder base such as 
30 lactose or starch. The compounds may be formulated for 
parenteral administration by injection, e.g., by bolus 
injection or continuous infusion. Formulations for injection 
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may be presented in unit dosage form, e.g., in ampoules or in 
multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 
5 formulatory agents such as suspending, stabilizing and/or 
dispersing agents. 

Pharmaceutical formulations for parenteral 
administration include aqueous solutions of the active 
compounds in water-soluble form. Additionally, suspensions 
of the active compounds may be prepared as appropriate oily 
10 injection suspensions. Suitable lipophilic solvents or 

vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or 
liposomes. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, 
such as sodium carboxymethyl cellulose, sorbitol, or dextran. 

15 Optionally, the suspension may also contain suitable 

stabilizers or agents which increase the solubility of the 
compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in 
powder form for constitution with a suitable vehicle, e.g., 
sterile pyrogen-free water, before use. 

20 The compounds may also be formulated in rectal 

compositions such as suppositories or retention enemas, e.g., 
containing conventional suppository bases such as cocoa 
butter or other glycerides . In addition to the formulations 
described previously, the compounds may also be formulated as 

^ a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneous ly or 
intramuscularly) or by intramuscular injection. Thus, for 
example, the compounds may be formulated with suitable 
polymeric or hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, or as 

^ sparingly soluble derivatives, for example, as a sparingly 
soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds 
of the invention is a cosolvent system comprising benzyl 
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alcohol, a nonpolar surfactant, a water -miscible organic 
polymer, and an aqueous phase. The cosolvent system may be 
the VPD co-solvent system. VPD is a solution of 3% w/v benzyl 
alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, 
5 and 65% w/v polyethylene glycol 3 00, made up to volume in 
absolute ethanol. The VPD co-solvent system (VPD:5W) 
consists of VPD diluted 1:1 with a 5% dextrose in water 
solution. This co-solvent system dissolves hydrophobic 
compounds well, and itself produces low toxicity upon 
systemic administration. Naturally, the proportions of a co- 

10 solvent system may be varied considerably without destroying 
its solubility and toxicity characteristics. Furthermore, 
the identity of the co-solvent components may be varied: for 
example, other low-toxicity nonpolar surfactants may be used 
instead of polysorbate 80; the fraction size of polyethylene 
glycol may be varied; other biocompatible polymers may 

15 replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and 
other sugars or polysaccharides may substitute for 
dextrose. Alternatively , other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. 
Liposomes and emulsions are well known examples of delivery 
vehicles or carriers for hydrophobic drugs. Certain organic 

20 solvents such as dimethylsulf oxide also may be employed, 
although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a 
sustained-release system, such as semipermeable matrices of 
solid hydrophobic polymers containing the therapeutic agent. 
Various of sustained-release materials have been established 

25 and are well known by those skilled in the art. Sustained- 
release capsules may, depending on their chemical nature, 
release the compounds for a few weeks up to over 100 days. 
Depending on the chemical nature and the biological stability 
of the therapeutic reagent, additional strategies for protein 
stabilization may be employed. 

30 The pharmaceutical compositions also may comprise 

suitable solid or gel phase carriers or excipients. Examples 
of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, 



-73- 



starches, cellulose derivatives, gelatin, and polymers such 
as polyethylene glycols. Many of the proteinase inhibiting 
compounds of the invention may be provided as salts with 
pharmaceutical ly compatible counterions . Such 
5 pharmaceutically acceptable base addition salts are those 
salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by 
reaction with inorganic or organic bases such as sodium 
hydroxide, magnesium hydroxide, ammonia, trialkylamine, 
dialkylamine, monoalkylamine, dibasic amino acids, sodium 
10 acetate, potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be 
in the form of a complex of the protein (s) of present 
invention along with protein or peptide antigens. The protein 
and/or peptide antigen will deliver a stimulatory signal to 
both B and T lymphocytes. B lymphocytes will respond to 
15 antigen through their surface immunoglobulin receptor. T 
lymphocytes will respond to antigen through the T cell 
receptor (TCR) following presentation of the antigen by MHC 
proteins. MHC and structurally related proteins including 
those encoded by class I and class II MHC genes on host cells 
will serve to present the peptide antigen (s) to T 
20 lymphocytes. The antigen components could also be supplied as 
purified MHC-peptide complexes alone or with co-stimulatory 
molecules that can directly signal T cells. Alternatively 
antibodies able to bind surface immunoglobulin and other 
molecules on B cells as well as antibodies able to bind the 
TCR and other molecules on T cells can be combined with the 
25 pharmaceutical composition of the invention. The 

pharmaceutical composition of the invention may be in the 
form of a liposome in which protein of the present invention 
is combined, in addition to other pharmaceutically acceptable 
carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid 
30 crystals, or lamellar layers in aqueous solution. Suitable 
lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithin, 
phospholipids, saponin, bile acids, and the like. Preparation 
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of such liposomal formulations is within the level of skill 
in the art, as disclosed, for example, in U.S. Pat. Nos. 
4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which 
are incorporated herein by reference. 
5 The amount of protein of the present invention in the 

pharmaceutical composition of the present invention will 
depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician 
will decide the amount of protein of the present invention 
10 with which to treat each individual patient. Initially, the 
attending physician will administer low doses of protein of 
the present invention and observe the patient's response. 
Larger doses of protein of the present invention may be 
administered until the optimal therapeutic effect is obtained 
for the patient, and at that point the dosage is not 
IS increased further. It is contemplated that the various 

pharmaceutical compositions used to practice the method of 
the present invention should contain about 0.01 jig to about 
100 mg (preferably about 0.1 ptg to about 10 mg, more 
preferably about 0.1 ng to about 1 mg) of protein of the 
present invention per kg body weight. For compositions of the 
20 present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method 
includes administering the composition topically, 
systematically, or locally as an implant or device. When 
administered, the therapeutic composition for use in this 
invention is, of course, in a pyrogen-free, physiologically 
25 acceptable form. Further, the composition may desirably be 
encapsulated or injected in a viscous form for delivery to 
the site of bone, cartilage or tissue damage. Topical 
administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of 
the invention which may also optionally be included in the 
30 composition as described above, may alternatively or 

additionally, be administered simultaneously or sequentially 
with the composition in the methods of the invention. 
Preferably for bone and/or cartilage formation, the 
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composition would include a matrix capable of delivering the 
protein-containing composition to the site of bone and/or 
cartilage damage, providing a structure for the developing 
bone and cartilage and optimally capable of being resorbed 
5 into the body. Such matrices may be formed of materials 
presently in use for other implanted medical applications. 

The choice of matrix material is based on 
biocompatibility, biodegradability, mechanical properties, 
cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate 
10 formulation. Potential matrices for the compositions may be 
biodegradable and chemically defined calcium sulfate, 
tricalciumphosphate, hydroxyapatite, polylactic acid, 
polyglycolic acid and polyanhydrides. Other potential 
materials are biodegradable and biologically well-defined, 
such as bone or dermal collagen. Further matrices are 
15 comprised of pure proteins or extracellular matrix 

components. Other potential matrices are nonbiodegradable and 
chemically defined, such as sintered hydroxyapatite, 
bioglass, aluminates, or other ceramics. Matrices may be 
comprised of combinations of any of the above mentioned types 
of material, such as polylactic acid and hydroxyapatite or 
20 collagen and tricalciumphosphate. The bioceramics may be 
altered in composition, such as in calcium-aluminate- 
phosphate and processing to alter pore size, particle size, 
particle shape, and biodegradability. Presently preferred is 
a 50:50 (mole weight) copolymer of lactic acid and glycolic 
acid in the form of porous particles having diameters ranging 
25 from 150 to 800 microns. In some applications, it will be 

useful to utilize a sequestering agent, such as carboxymethyl 
cellulose or autologous blood clot, to prevent the protein 
compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic 
materials such as alxylcelluloses (including 
30 hydroxyalkylcellu loses) , including methylcellulose, 
ethylcellulose, hy dr oxy ethy lcellulose, 

hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 
carboxymethylcellulose, the most preferred being cationic 
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salts of carboxymethylcellulose (CMC) . Other preferred 
sequestering agents include hyaluronic acid, sodium alginate, 
poly (ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly (vinyl alcohol). The amount of sequestering 

5 agent useful herein is 0.5-20 wt %, preferably 1-10 wt % 
based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from 
the polymer matrix and to provide appropriate handling of the 
composition, yet not so much that the progenitor cells are 
prevented from infiltrating the matrix, thereby providing the 

10 protein the opportunity to assist the osteogenic activity of 
the progenitor cells. In further compositions, proteins of 
the invention may be combined with other agents beneficial to 
the treatment of the bone and/or cartilage defect, wound, or 
tissue in question. These agents include various growth 
factors such as epidermal growth factor (EGF) , platelet 
15 derived growth factor (PDGF) , transforming growth factors 

(TGF-. alpha, and TGF-.beta.), and insulin-like growth factor 
(IGF) . 

The therapeutic compositions are also presently valuable 
for veterinary applications. Particularly domestic animals 
and thoroughbred horses, in addition to humans, are desired 

20 patients for such treatment with proteins of the present 
invention. The dosage regimen of a protein-containing 
pharmaceutical composition to be used in tissue regeneration 
will be determined by the attending physician considering 
various factors which modify the action of the proteins, 
e.g., amount of tissue weight desired to be formed, the site 

25 of damage, the condition of the damaged tissue, the size of a 
wound, type of damaged tissue (e.g., bone), the patient's 
age, sex, and diet, the severity of any infection, time of 
administration and other clinical factors. The dosage may 
vary with the type of matrix used in the reconstitution and 
with inclusion of other proteins in the pharmaceutical 
30 composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the 
final composition, may also effect the dosage. Progress can 
be monitored by periodic assessment of tissue/bone growth 
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and/or repair, for example, X-rays, histomorphometric 
determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be 
used for gene therapy. Such polynucleotides can be introduced 
5 either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also 
be administered by other known methods for introduction of 
nucleic acid into a cell or organism (including, without 
limitation, in the form of viral vectors or naked DNA) . 
Cells may also be cultured ex vivo in the presence of 
10 proteins of the present invention in order to proliferate or 
to produce a desired effect on or activity in such cells. 
Treated cells can then be introduced in vivo for therapeutic 
purposes. 

5.6.3. Effective Dosage 

15 Pharmaceutical compositions suitable for use in the 

present invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve 
its intended purpose. More specifically, a therapeutically 
effective amount means an amount effective to prevent 
development of or to alleviate the existing symptoms of the 

20 subject being treated. Determination of the effective 

amounts is well within the capability of those skilled in the 
art, especially in light of the detailed disclosure provided 
herein. For any compound used in the method of the invention, 
the therapeutically effective dose can be estimated initially 
from cell culture assays. For example, a dose can be 

25 formulated in animal models to achieve a circulating 

concentration range that includes the IC 50 as determined in 
cell culture (i.e., the concentration of the test compound 
which achieves a half-maximal inhibition of the C-proteinase 
activity) . Such information can be used to more accurately 
determine useful doses in humans. 
30 A therapeutically effective dose refers to that amount 

of the compound that results in amelioration of symptoms or a 
prolongation of survival in a patient. Toxicity and 
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therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or 
experimental animals, e.g., for determining the LD 50 (the 
dose lethal to 50% of the population) and the ED 50 (the dose 
5 therapeutically effective in 50% of the population) . The dose 
ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio 
between LD 50 and ED 50 . Compounds which exhibit high 
therapeutic indices are preferred. The data obtained from 
these cell culture assays and animal studies can be used in 
10 formulating a range of dosage for use in human. The dosage 
of such compounds lies preferably within a range of 
circulating concentrations that include the ED 50 with little 
or no toxicity. The dosage may vary within this range 
depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of 
15 administration and dosage can be chosen by the individual 
physician in view of the patient's condition. See, e.g., 
Fingl et al . , 1975, in "The Pharmacological Basis of 
Therapeutics", Ch. l p. l. Dosage amount and interval may be 
adjusted individually to provide plasma levels of the active 
2o moiety which are sufficient to maintain the C-proteinase 

inhibiting effects, or minimal effective concentration (MEC) . 
The MEC will vary for each compound but can be estimated from 
in vitro data; for example, the concentration necessary to 
achieve 50-90% inhibition of the C-proteinase using the 
assays described herein. Dosages necessary to achieve the 
25 MEC will depend on individual characteristics and route of 
administration. However, HPLC assays or bioassays can be 
used to determine plasma concentrations. 

Dosage intervals can also be determined using MEC value. 
Compounds should be administered using a regimen which 
maintains plasma levels above the MEC for 10-90% of the time, 
30 preferably between 30-90% and most preferably between 50- 

90%. In cases of local administration or selective uptake, the 
effective local concentration of the drug may not be related 
to plasma concentration. 
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The amount of composition administered will, of course, 
be dependent on the subject being treated, on the subject's 
weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

5 

5.6.4. Packaging 

The compositions may, if desired, be presented in a pack 
or dispenser device which may contain one or more unit dosage 
forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister 
10 pack. The pack or dispenser device may be accompanied by 
instructions for administration. Compositions comprising a 
compound of the invention formulated in a compatible 
pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labelled for treatment of an 
indicated condition. 

15 

5.7. Antibodies 

Another aspect of the invention is an antibody that 
specifically binds the polypeptide of the invention. Such 
antibodies can be either monoclonal or polyclonal antibodies, 
as well fragments thereof and humanized forms or fully human 
^ forms, such as those produced in transgenic animals. The 
invention further provides a hybridoma that produces an 
antibody according to the invention. Antibodies of the 
invention are useful for detection and/or purification of the 
polypeptides of the invention. 

Protein of the invention may also be used to immunize 
^ animals to obtain polyclonal and monoclonal antibodies which 
specifically react with the protein. Such antibodies may be 
obtained using either the entire protein or fragments thereof 
as an immunogen. The peptide immunogens additionally may 
contain a cysteine residue at the carboxyl terminus, and are 
conjugated to a hapten such as keyhole limpet hemocyanin 
J0 (KLH) . Methods for synthesizing such peptides are known in 
the art, for example, as in R. P. Merrifield, J. Amer. Chem. 
Soc. 85, 2149-2154 (1963); J. L. Krstenansky, et al., FEBS 
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Lett. 211, 10 (1987) . Monoclonal antibodies binding to the 
protein of the invention may be useful diagnostic agents for 
the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful 
5 therapeutics for both conditions associated with the protein 
and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case 
of cancerous cells or leukemic cells, neutralizing monoclonal 
antibodies against the protein may be useful in detecting and 
preventing the metastatic spread of the cancerous cells, 

10 which may be mediated by the protein. In general, techniques 
for preparing polyclonal and monoclonal antibodies as well as 
hybridomas capable of producing the desired antibody are well 
known in the art (Campbell, A.M., Monoclonal Antibodies 
Technology: Laboratory Techniques in Biochemistry and 
Molecular Biology, Elsevier Science Publishers, Amsterdam, 
The Netherlands (1984); St. Groth et al., J. Immunol. 35:1-21 
(1990); Kohler and Milstein, Nature 256:495-497 (1975)), the 
trioma technique, the human B-cell hybridoma technique 
(Kozbor et al. r Immunology Today 4:72 (1983); Cole et al . , in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. 

20 (1985) , pp. 77-96) . 

Any animal (mouse, rabbit, etc.) which is known to 
produce antibodies can be immunized with a peptide or 
polypeptide of the invention. Methods for immunization are 
well known in the art. Such methods include subcutaneous or 
intraperitoneal injection of the polypeptide. One skilled in 

^ the art will recognize that the amount of the protein encoded 
by the ORF of the present invention used for immunization 
will vary based on the animal which is immunized, the 
antigenicity of the peptide and the site of injection. The 
protein that is used as an immunogen may be modified or 
administered in an adjuvant in order to increase the 

^ protein's antigenicity. Methods of increasing the 

antigenicity of a protein are well known in the art and 
include, but are not limited to, coupling the antigen with a 
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heterologous protein (such as globulin or (3-galactosidase) or 
through the inclusion of an adjuvant during immunization. 

For monoclonal antibodies, spleen cells from the 
immunized animals are removed, fused with myeloma cells, such 
5 as sP2/0-Agl4 myeloma cells, and allowed to become monoclonal 
antibody producing hybridoma cells. Any one of a number of 
methods well known in the art can be used to identify the 
hybridoma cell which produces an antibody with the desired 
characteristics. These include screening the hybridomas with 
an ELISA assay, western blot analysis, or radioimmunoassay 

10 (Lutz et al., Exp. Cell Research. 275:109-124 (1988)). 

Hybridomas secreting the desired antibodies are cloned and 
the class and subclass is determined using procedures known 
in the art (Campbell, A.M., Monoclonal Antibody Technology: 
Laboratory Techniques in Biochemistry and Molecular Biology, 

is Elsevier Science Publishers, Amsterdam, The Netherlands 
(1984)). Techniques described for the production of single 
chain antibodies (U.S. Patent 4,946,778) can be adapted to 
produce single chain antibodies to proteins of the present 
invention. 

For polyclonal antibodies, antibody containing antiserum 
20 iS isolated from the immunized animal and is screened for the 
presence of antibodies with the desired specificity using one 
of the above-described procedures. The present invention 
further provides the above-described antibodies in delectably 
labeled form. Antibodies can be delectably labeled through 
the use of radioisotopes, affinity labels (such as biotin, 
25 avidin, etc.), enzymatic labels (such as horseradish 

peroxidase, alkaline phosphatase, etc.) fluorescent labels 
(such as FITC or rhodamine, etc.), paramagnetic atoms, etc. 
Procedures for accomplishing such labeling are well-known in 
the art, for example, see ( Sternberger , L.A. et al . , J. 
Histochem. Cytochem. 18:315 (1970); Bayer, E.A. et al., Meth. 
30 Enzym. 62:308 (1979); Engval, E. et al . , Immunol. 109:129 
(1972); Goding, J.w. J . Immunol. Meth. 13:215 (1976)). 
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The labeled antibodies of the present invention can be 
used for in vitro, in vivo, and in situ assays to identify 
cells or tissues in which a fragment of the polypeptide of 
interest is expressed. The antibodies may also be used 

5 directly in therapies or other diagnostics. The present 
invention further provides the above-described antibodies 
immobilized on a solid support. Examples of such solid 
supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and sepharose, acrylic resins 
and such as polyacrylamide and latex beads. Techniques for 

10 coupling antibodies to such solid supports are well known in 
the art (Weir, D.M. et al., "Handbook of Experimental 
Immunology" 4th Ed., Blackwell Scientific Publications, 
Oxford, England, Chapter 10 (1986); Jacoby, W.D. et al., 
Meth. Enzym. 34 Academic Press, N.Y. (1974)). The 

15 immobilized antibodies of the present invention can be used 
for in vitro, in vivo, and in situ assays as well as for 
immuno-af f inity purification of the proteins of the present 
invention. 



5.8. Computer Readable Sequences 

In one application of this embodiment, a nucleotide 
sequence of the present invention can be recorded on computer 
readable media. As used herein, "computer readable media" 
refers to any medium which can be read and accessed directly 
by a computer. Such media include, but are not limited to: 
magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape; optical storage media such 
as CD-ROM; electrical storage media such as RAM and ROM; and 
hybrids of these categories such as magnetic/ optical storage 
media. A skilled artisan can readily appreciate how any of 
the presently known computer readable mediums can be used to 
create a manufacture comprising computer readable medium 
having recorded thereon a nucleotide sequence of the present 
invention. As used herein, "recorded" refers to a process for 
storing information on computer readable medium. A skilled 
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artisan can readily adopt any of the presently known methods 
for recording information on computer readable medium to 
generate manufactures comprising the nucleotide sequence 
information of the present invention. 

5 A variety of data storage structures are available to a 

skilled artisan for creating a computer readable medium 
having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored 
information. In addition, a variety of data processor 

10 programs and formats can be used to store the nucleotide 
sequence information of the present invention on computer 
readable medium. The sequence information can be represented 
in a word processing text file, formatted in commercially- 
available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a 

15 database application, such as DB2 , Sybase, Oracle, or the 
like. A skilled artisan can readily adapt any number of 
dataprocessor structuring formats (e.g. text file or 
database) in order to obtain computer readable medium having 
recorded thereon the nucleotide sequence information of the 
present invention. By providing the nucleotide sequence of 

20 SEQ ID NO:l or a representative fragment thereof, or a 

nucleotide sequence at least 99.9% identical to SEQ ID NO:l 
in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. 
Computer software is publicly available which allows a 
skilled artisan to access sequence information provided in a 

25 computer readable medium. The examples which follow 

demonstrate how software which implements the BLAST (Altschul 
et al., J. Mol. Biol. 225:403-410 (1990)) and BLAZE (Brutlag 
et al., Comp. Chem. 27:203-207 (1993)) search algorithms on a 
Sybase system is used to identify open reading frames (ORFs) 
within a nucleic acid sequence. Such ORFs may be protein 
encoding fragments and may be useful in producing 
commercially important proteins such as enzymes used in 
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fermentation reactions and in the production of commercially 
useful metabolites. 

As used herein, "a computer-based system" refers to the 
hardware means, software means, and data storage means used 
5 to analyze the nucleotide sequence information of the present 
invention. The minimum hardware means of the computer-based 
systems of the present invention comprises a central 
processing unit (CPU) , input means, output means, and data 
storage means. A skilled artisan can readily appreciate that 
any one of the currently available computer-based systems are 
10 suitable for use in the present invention. As stated above, 
the computer-based systems of the present invention comprise 
a data storage means having stored therein a nucleotide 
sequence of the present invention and the necessary hardware 
means and software means for supporting and implementing a 
search means. As used herein, "data storage means" refers to 
15 memory which can store nucleotide sequence information of the 
present invention, or a memory access means which can access 
manufactures having recorded thereon the nucleotide sequence 
information of the present invention. 

As used herein, "search means" refers to one or more 
programs which are implemented on the computer-based system 
20 to compare a target sequence or target structural motif with 
the sequence information stored within the data storage 
means. Search means are used to identify fragments or 
regions of a known sequence which match a particular target 
sequence or target motif. A variety of known algorithms are 
disclosed publicly and a variety of commercially available 
25 software for conducting search means are and can be used in 
the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, 
MacPattern (EMBL) , BLASTN and BLASTA (NPOLYPEPTIDEIA) . A 
skilled artisan can readily recognize that any one of the 
available algorithms or implementing software packages for 
30 conducting homology searches can be adapted for use in the 
present computer-based systems. As used herein, a "target 
sequence" can be any nucleic acid or amino acid sequence of 
six or more nucleotides or two or more amino acids. A 
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skilled artisan can readily recognize that the longer a 
target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most 
preferred sequence length of a target sequence is from about 
5 10 to 100 amino acids or from about 30 to 3 00 nucleotide 
residues. However, it is well recognized that searches for 
commercially important fragments, such as sequence fragments 
involved in gene expression and protein processing, may be of 
shorter length. 

As used herein, "a target structural motif," or "target 
10 motif," refers to any rationally selected sequence or 

combination of sequences in which the sequence (s) are chosen 
based on a three-dimensional configuration which is formed 
upon the folding of the target motif. There are a variety of 
target motifs known in the art. Protein target motifs 
include, but are not limited to, enzyme active sites and 
15 signal sequences. Nucleic acid target motifs include, but 
are not limited to, promoter sequences, hairpin structures 
and inducible expression elements (protein binding 
sequences) . 

5-9. Triplex Helix Formation 

20 In addition, the fragments of the present invention, as 

broadly described, can be used to control gene expression 
through triple helix formation or antisense DNA or RNA, both 
of which methods are based on the binding of a polynucleotide 
sequence to DNA or RNA. Polynucleotides suitable for use in 
these methods are usually 20 to 40 bases in length and are 
25 designed to be complementary to a region of the gene involved 
in transcription (triple helix - see Lee et al . , Nucl. Acids 
Res. 6:3073 (1979); Cooney et al., Science 15241:456 (1988); 
and Dervan et al., Science 251:1360 (1991)) or to the mRNA 
itself (antisense - Olmno, J. Neurochem. 56:560 (1991) ; 
30 Oligodeoxynucleotides as Antisense Inhibitors of Gene 

Expression, CRC Press, Boca Raton, FL (1988)). Triple helix- 
formation optimally results in a shut-off of RNA 



transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. 
Both techniques have been demonstrated to be effective in 
model systems. Information contained in the sequences of the 
5 present invention is necessary for the design of an antisense 
or triple helix oligonucleotide. 

5.10. Diagnostic Assays and Kits 

The present invention further provides methods to 
identify the presence or expression of one of the ORFs of the 
10 present invention, or homo log thereof, in a test sample, 
using a nucleic acid probe or antibodies of the present 
invention. 

In general, methods for detecting a polynucleotide of 
the invention can comprise contacting a sample with a 
compound that binds to and forms a complex with the 
15 polynucleotide for a period sufficient to form the complex, 
and detecting the complex, so that if a complex is detected, 
a polynucleotide of the invention is detected in the sample. 

Such methods can also comprise contacting a sample under 
stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such 
20 conditions, and amplifying annealed polynucleotides, so that 
if a polynucleotide is amplified, a polynucleotide of the 
invention is detected in the sample. 

In general, methods for detecting a polypeptide of the 
invention can comprise contacting a sample with a compound 
that binds to and forms a complex with the polypeptide for a 
25 period sufficient to form the complex, and detecting the 

complex, so that if a complex is detected, a polypeptide of 
the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample 
with one or more of the antibodies or one or more of nucleic 
acid probes of the present invention and assaying for binding 
30 of the nucleic acid probes or antibodies to components within 
the test sample. 

Conditions for incubating a nucleic acid probe or 
antibody with a test sample vary. Incubation conditions 
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depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the nucleic acid 
probe or antibody used in the assay. One skilled in the art 
will recognize that any one of the commonly available 
5 hybridization, amplification or immunological assay formats 
can readily be adapted to employ the nucleic acid probes or 
antibodies of the present invention. Examples of such assays 
can be found in Chard, T. , An Introduction to 
Radioimmunoassay and Related Techniques, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. 

*® et al., Techniques in Immunocytochemistry , Academic Press, 
Orlando, FLVol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); 
Tijssen, P., Practice and Theory of immunoassays: Laboratory 
Techniques in Biochemistry and Molecular Biology, Elsevier 
Science Publishers, Amsterdam, The Netherlands (1985) . The 

15 test samples of the present invention include cells, protein 
or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used 
in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells 
or extracts used as the sample to be assayed. Methods for 

20 preparing protein extracts or membrane extracts of cells are 
well known in the art and can be readily be adapted in order 
to obtain a sample which is compatible with the system 
utilized. 

In another embodiment of the present invention, kits are 
provided which contain the necessary reagents to carry out 

25 the assays of the present invention. Specifically, the 
invention provides a compartment kit to receive, in close 
confinement, one or more containers which comprises: (a) a 
first container comprising one of the probes or antibodies of 
the present invention; and (b) one or more other containers 
comprising one or more of the following: wash reagents, 

30 reagents capable of detecting presence of a bound probe or 
antibody. 

In detail, a compartment kit includes any kit in which 
reagents are contained in separate containers. Such 



containers include small glass containers, plastic containers 
or strips of plastic or paper. Such containers allows one to 
efficiently transfer reagents from one compartment to another 
compartment such that the samples and reagents are not cross- 

5 contaminated, and the agents or solutions of each container 
can be added in a quantitative fashion from one compartment 
to another. Such containers will include a container which 
will accept the test sample, a container which contains the 
antibodies used in the assay, containers which contain wash 
reagents (such as phosphate buffered saline, Tris-buf f ers, 

10 etc.), and containers which contain the reagents used to 
detect the bound antibody or probe. Types of detection 
reagents include labeled nucleic acid probes, labeled 
secondary antibodies, or in the alternative, if the primary 
antibody is labeled, the enzymatic, or antibody binding 
reagents which are capable of reacting with the labeled 

15 antibody. One skilled in the art will readily recognize that 
the disclosed probes and antibodies of the present invention 
can be readily incorporated into one of the established kit 
formats which are well known in the art. 

5.11. Screening Assays 

20 Using the isolated proteins and polynucleotides of the 

invention, the present invention further provides methods of 
obtaining and identifying agents which bind to a protein 
encoded by the ORF from a nucleic acid with a sequence of SEQ 
ID N0:1, to a specific domain of the polypeptide encoded by 
the nucleic acid, or to a nucleic acid with a sequence of SEQ 

25 ID NO:l. In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein 
encoded by an ORF of the present invention, or 
nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein 
or said nucleic acid. 

30 in general, therefore, such methods for identifying 

compounds that bind to a polynucleotide of the invention can 
comprise contacting a compound with a polynucleotide of the 
invention for a time sufficient to form a 
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polynucleotide/compound complex, and detecting the complex, 
so that if a polynucleotide/compound complex is detected, a 
compound that binds to a polynucleotide of the invention is 
identified. 

5 Likewise, in general, therefore, such methods for 

identifying compounds that bind to a polypeptide of the 
invention can comprise contacting a compound with a 
polypeptide of the invention for a time sufficient to form a 
polypeptide /compound complex, and detecting the complex, so 
that if a polypeptide/compound complex is detected, a 

10 compound that binds to a polynucleotide of the invention is 
identified. 

Methods for identifying compounds that bind to a 
polypeptide of the invention can also comprise contacting a 
compound with a polypeptide of the invention in a cell for a 
time sufficient to form a polypeptide/compound complex, 

15 wherein the complex drives expression of a receptor gene 

seguence in the cell, and detecting the complex by detecting 
reporter gene sequence expression, so that if a 
polypeptide/compound complex is detected, a compound that 
binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include 

20 compounds which modulate the activity of a polypeptide of the 
invention (that is, increase or decrease its activity, 
relative to activity observed in the absence of the 
compound) . Alternatively, compounds identified via such 
methods can include compounds which modulate the expression 
of a polynucleotide of the invention (that is, increase or 

25 decrease expression relative to expresssion levels observed 
in the absence of the compound) . Compounds, such as 
compounds identified via the methods of the invention, can be 
tested using standard assays well known to those of skill in 
the art for their ability to modulate activity/expression. 

The agents screened in the above assay can be, but are 

30 not limited to, peptides, carbohydrates, vitamin derivatives, 
or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed 
using protein modeling techniques. 
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For random screening, agents such as peptides, 
carbohydrates, pharmaceutical agents and the like are 
selected at random and are assayed for their ability to bind 
to the protein encoded by the ORF of the present invention. 
Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected 
or designed" when the agent is chosen based on the 
configuration of the particular protein. For example, one 
skilled in the art can readily adapt currently available 
procedures to generate peptides, pharmaceutical agents and 
the like capable of binding to a specific peptide sequence in 
order to generate rationally designed antipeptide peptides, 
for example see Hurby et al., Application of Synthetic 
Peptides: Antisense Peptides," In Synthetic Peptides, A 
User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al . , Biochemistry 28:9230-8 (1989), or 
pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the 
present invention, as broadly described, can be used to 
control gene expression through binding to one of the ORFs or 
EMFs of the present invention. As described above, such 
agents can be randomly screened or rationally 

designed/ selected. Targeting the ORF or EMF allows a skilled 
artisan to design sequence specific or element specific 
agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression 
control. One class of DNA binding agents are agents which 
contain base residues which hybridize or form a triple helix 
formation by binding to DNA or RNA. Such agents can be based 
on the classic phosphodiester, ribonucleic acid backbone, or 
can be a variety of sulfhydryl or polymeric derivatives which 
have base attachment capacity. 

Agents suitable for use in these methods usually contain 
20 to 40 bases and are designed to be complementary to a 
region of the gene involved in transcription (triple helix - 
see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et 
al., Science 241:456 (1988); and Dervan et al., Science 
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252:1360 (1991)) or to the hlRNA itself (antisense - Okano, J. 
Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 
(1988)). Triple helix- formation optimally results in a 
shut-off of RNA transcription from DNA, while antisense RNA 
hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be 
effective in model systems. Information contained in the 
sequences of the present invention is necessary for the 
desiqn of an antisense or triple helix oligonucleotide and 
other DNA binding agents. Agents which bind to a protein 
encoded by one of the ORFs of the present invention can be 
used as a diagnostic agent, in the control of bacterial 
infection by modulating the activity of the protein encoded 
by the ORF. Agents which bind to a protein encoded by one of 
the ORFs of the present invention can be formulated using 
known techniques to generate a pharmaceutical composition. 

5.10. Use of Nucleic Acids as Probes 

Another aspect of the subject invention is to provide 
for polypeptide-specif ic nucleic acid hybridization probes 
capable of hybridizing with naturally occurring nucleotide 
sequences. The hybridization probes of the subject invention 
may be derived from the nucleotide sequence of the SEQ ID 
NO:l. Because the corresponding gene is only expressed in a 
limited number of tissues, especially adult tissues, a 
hybridization probe derived from SEQ ID N0:1 can be used as 
an indicator of the presence of RNA of cell type of such a 
tissue in a sample. 

Any suitable hybridization technique can be employed, 
such as, for example, in situ hybridization. PCR as described 
US Patent Nos 4,683,195 and 4,965,188 provides additional 
uses for oligonucleotides based upon the nucleotide 
sequences. Such probes used in PCR may be of recombinant 
origin, may be chemically synthesized, or a mixture of both. 
The probe will comprise a discrete nucleotide sequence for 
the detection of identical sequences or a degenerate pool of 



possible sequences for identification of closely related 
genomic sequences . 

Other means for producing specific hybridization probes 
for nucleic acids include the cloning of nucleic acid 
5 sequences into vectors for the production of mRNA probes. 
Such vectors are known in the art and are commercially 
available and may be used to synthesize RNA probes in vitro 
by means of the addition of the appropriate RNA polymerase as 
T7 or SP6 RNA polymerase and the appropriate radioactively 
labeled nucleotides. The nucleotide sequences may be used to 
10 construct hybridization probes for mapping their respective 
genomic sequences. The nucleotide sequence provided herein 
may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal 
mapping techniques. These techniques include in situ 
hybridization, linkage analysis against known chromosomal 
15 markers, hybridization screening with libraries or flow- 
sorted chromosomal preparations specific to known 
chromosomes, and the like. The technique of fluorescent in 
situ hybridization of chromosome spreads has been described, 
among other places, in Verma et al (1988) Human Chromosomes: 
A Manual of Basic Techniques, Pergamon Press, New York NY. 
20 Fluorescent in situ hybridization of chromosomal 

preparations and other physical chromosome mapping techniques 
may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of 
Science (265:1981f). Correlation between the location of a 
nucleic acid on a physical chromosomal map and a specific 
25 disease (or predisposition to a specific disease) may help 
delimit the region of DNA associated with that genetic 
disease. The nucleotide sequences of the subject invention 
may be used to detect differences in gene sequences between 
normal, carrier or affected individuals .The nucleotide 
sequence may be used to produce purified polypeptides using 
30 well known methods of recombinant DNA technology. Among the 
many publications that teach methods for the expression of 
genes after they have been isolated is Goeddel (1990) Gene 
Expression Technology, Methods and Enzymology, Vol 185, 
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Academic Press, San Diego. Polypeptides may be expressed in 
a variety of host cells, either prokaryotic or eukaryotic. 
Host cells may be from the same species from which a 
particular polypeptide nucleotide sequence was isolated or 
5 from a different species. Advantages of producing 

polypeptides by recombinant DNA technology include obtaining 
adequate amounts of the protein for purification and the 
availability of simplified purification procedures. 

Each sequence so obtained was compared to sequences in 
GenBank using a search algorithm developed by Applied 
10 Biosystems and incorporated into the INHERIT™ 670 Sequence 
Analysis System. In this algorithm, Pattern Specification 
Language (developed by TRW Inc., Los Angeles, CA) was used to 
determine regions of homology. The three parameters that 
determine how the sequence comparisons run were window size, 
window offset, and error tolerance. Using a combination of 
15 these three parameters, the DNA database was searched for 
sequences containing regions of homology to the query 
sequence, and the appropriate sequences were scored with an 
initial value. Subsequently, these homologous regions were 
examined using dot matrix homology plots to distinguish 
regions of homology from chance matches. Smith-Waterman 
20 alignments were used to display the results of the homology 
search. Peptide and protein sequence homologies were 
ascertained using the INHERIT" 670 Sequence Analysis System in 
a way similar to that used in DNA sequence homologies. 
Pattern Specification Language and parameter windows were 
used to search protein databases for sequences containing 
25 regions of homology that were scored with an initial value. 
Dot-matrix homology plots were examined to distinguish 
regions of significant homology from chance matches. 

Alternatively, BLAST, which stands for Basic Local 
Alignment Search Tool, is used to search for local sequence 
alignments (Altschul SF (1993) J Mol Evol 36:290-300; 
30 Altschul, SF et al (1990) J Mol Biol 215:403-10). BLAST 
produces alignments of both nucleotide and amino acid 
sequences to determine sequence similarity. Because of the 
local nature of the alignments, BLAST is especially useful in 
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determining exact matches or in identifying home-logs. 
Whereas it is ideal for matches which do not contain gaps, it 
is inappropriate for performing motif-style searching. The 
fundamental unit of BLAST algorithm output is the High- 
5 scoring Segment Pair (HSP) . An HSP consists of two sequence 
fragments of arbitrary but equal lengths whose alignment is 
locally maximal and for which the alignment score meets or 
exceeds a threshold or cutoff score set by the user. The 
BLAST approach is to look for HSPs between a query sequence 
and a database sequence, to evaluate the statistical 
10 significance of any matches found, and to report only those 
matches which satisfy the user-selected threshold of 
significance. The parameter E establishes the statistically 
significant threshold for reporting database sequence 
matches. E is interpreted as the upper bound of the expected 
frequency of chance occurrence of an HSP (or set of HSPs) 
15 within the context of the entire database search. Any 

database sequence whose match satisfies E is reported in the 
program output. 

In addition, BLAST analysis was used to search for 
related molecules within the libraries of the LIFESEQ" 
database. This process, an "electronic northern" analysis is 
20 analogous to northern blot analysis in that it uses one 
cellubrevin sequence at a time to search for identical or 
homologous molecules at a set stringency. The stringency of 
the electronic northern is based on "product score". The 
product score is defined as (% nucleotide or amino acid 
[between the query and reference sequences] in Blast 
25 multiplied by the % maximum possible BLAST score [based on 
the lengths of query and reference sequences]) divided by 
100. At a product score of 40, the match will be exact 
within a 1-2% error; and at 70, the match will be exact. 
Homologous or related molecules can be identified by 
selecting those which show product scores between 
30 approximately 15 and 30. 

The present invention is illustrated in the following 
examples. Upon consideration of the present disclosure, one 
of skill in the art will appreciate that many other 
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embodiments and variations may be made in the scope of the 
present invention. Accordingly, it is intended that the 
broader aspects of the present invention not be limited to 
the disclosure of the following examples. 

6. EXAMPLE: Identification of Novel CD-39-LiJce 

Nucleic Acid and Polynucleotide Molecules 

Described herein is the cloning and characterization of 
novel CD-39-like nucleotide-triphosphatase ("NTPase") gene 
and polypeptide sequences. These sequences are referred to 
below as "mNTPase" , "mCD39L4" and "CD39L2." 

First, a novel murine family member was cloned by low 
stringency screening of mouse cDNA libraries with a human 
CD39L1 cDNA clone (Chadwick, B.P. & Frischauf A.-M., 1997, 
Mamm. Genome 8:668-672). A 1738 bp cDNA clone was isolated 
from an adult mouse testis cDNA library (Stratagene Ltd. , 
Cambridge, UK) and sequenced. DNA sequence comparisons with 
the human CD39L1 cDNA sequence showed moderate DNA homology 
of approximately 39% identity) . An open reading frame (ORF) 
could be detected for the cNDA sequence, but indicated that 
the cDNA clone did not contain the initiation methionine 
codon and, therefore, did not extend to the 5» end. Database 
searches with the mouse cDNA sequence identified two mouse 
EST clones that extended the cDNA sequence at the 5' end 
(Accession Nos. AA116990 and AA12 07 57) . The EST clones were 
resequenced. The cloned and the resequenced nucleotide 
sequences were analyzed and were combined appropriately to 
yield the nucleotide sequence (SEQ ID NO: 7) depicted in FIG. 
1, and referred to herein as mCD39L4 or mNTPase. The 
sequence revealed an ORF from nucleotides 2 05 to 1599 with 
the ATG at nucleotide 205 having a moderate match to the 
initiation start site for vertebrates (AAGAAUAUGG for mNTPase 
versus GCCGCCAUGG; Kozak, M. , 1989, J. Biol. Chem. 108:229- 
241). The derived amino acid sequence is also shown in FIG. 
2 (SEQ ID NO:8). No apparent polyadenylation signal existed, 
although the cDNA clone isolated contained a poly-A tail. 



Hydropathy plots with Topred-II 1.1 (Claros, M.G. & Von 
Hejine, G. , 1994, Comput. Appl. Biosci. 10:685-686) predict a 
single potential transmembrane segment close to the amino 
terminus of the protein, suggesting a single-pass 
5 transmembrane protein with a large extracellular domain. Two 
potential glycosylation sites can be found at amino acid 
positions 41 (NVSA) and at 231 (NSTF) , suggesting that 
mNTPase is glycosylated. 

Database searches with the derived amino acid sequence 
identified homology with other members of the NTPase family. 
10 FIG. 2 shows an alignment of the full mNTPase (mCD39L4) 

protein sequence against three of the most homologous known 
NTPases, from garden pea, potato and Saccharomyces 
cer&visiae. The mNTPase protein shares approximately 30% 
amino acid identity with the three other NTPases. 

The region of highest homology between all members of 
15 the NTPase family is at the amino terminus of the protein. 
Handa & Guidotti (Handa, M. & Guidotti, G. , 1996, Biochem. 
Biophys. Res. Commun. 218 :916-923) highlighted four regions 
of NTPases referred to as putative apyrase-conserved regions 
("ACRs") . FIG. 3 shows an alignment of ACRs I-IV. (See 
Section 3, Section 5, and its subsections, above, for a 
20 delineation of the amino acid residues that make up ACRs I-IV 
of the CD39-like polypeptides of the invention.) ACR 
conservation would indicate that these regions are essential 
for the functioning of the protein, while changes in the 
regions surrounding these domains can be tolerated. The 
presence of all four ACRs in the mNTPase (mCD39L2) indicates 
that mNTPase is a new member of the NTPase family. 

BLAST searches with the DNA sequence of mNTPase 
(mCD39L4) revealed two overlapping human EST clones with 57% 
DNA sequence identity to portions of mNTPase (Accession Nos. 
H08436 and AA378537) . Upon combination and analysis of the 
resulting sequence, an ORF was identified that showed 
>0 homology to NTPases. The putative NTPase protein sequence, 
referred to herein as "CD39L2," is shown in FIG. 3 alongside 
the other NTPase protein sequences. The identification and 
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characterization of the full-length CD39L2 polypeptide and 
nucleotide sequences is described in the Example presented in 
Section 7, below. 

To map the murine mNTPase gene, a cosmid was isolated 
5 from a mouse cosmid library, and used for fluorescence in 
situ hybridization (FISH). For the FISH analysis, slides 
with mouse metaphase chromosomes were prepared from spleens 
as described in Monier et al. (Monier, K. et al., 1996, 
Cytogenet. Cell Genet. 72:200-204). 1 microgram of mouse 
cosmid containing the mNTPase gene was labeled with biotin 
10 14-ATP and a Bionick Kit (GibcoBRL) . DNA was purified by 
passage through a Sephadex G50 column and ethanol 
precipitated with 50 micrograms of sheared salmon sperm DNA 
and tRNA. 80 ng of probe was dried down with 3 micrograms of 
mouse Cot-1 DNA (GibcoBRL) . Hybridization was carried out as 
described (Ragoussis, J. et al . , 1992, Genomics 14:423-430). 
15 Confirmation of chromosomal location was achieved by 

rehybridizing the same slide with a mouse Chromosome 12- 
specif ic Starfish Paint (Cambio) . 

The FISH study revealed the presence of mNTPase on mouse 
Chr. 12 at chromosome band E. To confirm the location of the 
mNTPase gene on mouse Chr. 12, linkage analysis was carried 
20 out. upon the European Collaborative Interspecific Backcross 
(EUCIB) . PCR primers were designed to the 3' untranslated 
region of the mNTPase cDNA sequence and used for PCR by use 
of mouse genomic DNA from the two parental mouse strains, Mus 
spretus and C57BL/6. A polymorphism was detected between the 
two strains by SSCP analysis and was used for the mapping. 
25 (PCR conditions: 48°C 20 sec, Primer 1: 

CCAGACTGTAAATCTTTTGG; Primer 2: AGGGAATGTAATAAGGGTAG ; 
conditions: 94°C 2 minutes; 35 cycles of 94°C 20 sec. 72°C 
20 sec, 72°C 1 min; product size: 320 bp). 

Linkage with a LOD score of 8.14 was obtained with the 
genetic marker D12Mit4, flanked by D12Mitl49 and D12Mit238, 
30 between 31.7 cM and 3 6.1 cM from the top of the mouse Chr. 12 
linkage group for the MIT F 2 Intercross. This region of mouse 
Chr. 12 has previously been shown to share synteny with human 
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Chr. 14q (DeBry, R.W. & Seldin, M.F., 1996, Genomics 33:337- 
351) . 



5 7. EXAMPLE: Identification and Characterization of 
Additional Novel CD39-Like Polypeptides 
and Nucleic Acid Molecules 

Described herein is the cloning and 
characterization of novel CD-39-like nucleotide- 
triphosphatase ("NTPase") gene and polypeptide sequences. 
10 These sequences are referred to below as "CD39L2," "CD39L3," 
"CD39L4" and "dCD39L4." 



7.1. Materials and Methods 

Identification, isolat ion, and sequencing of cDNA clones 
for CD39L2. CD39L3 . and CD39T.4. The nucleotide sequence of 
CD39 (Accession No. S73813) , CD39L1 (Accession No. U91510) , 
and mNTPase (see Section 6, above) were used in TBLASTX 
searches against entries in the expressed sequence tag (EST) 
database at EMBL/ GenBank, using the Bork server 
(http://www.bork.embl-heidelberg.de/). cDNA clones for 
homologous IMAGE EST entries were obtained from the Human 
Genome Mapping Project Resource Centre (HGMP, Hinxton, UK) . 
DNA was prepared with QiaTip-100 (Qiagen) , and the cDNA was 
sequenced by primer walking with a fluorescence labeled dye- 
terminator cycle sequencing kit according to the 
manufacturer's instructions (PRISM Ready Dye-Deoxy Terminator 
Premix from Applied Biosystems Inc.) and electrophoresed on 
an ABI 373 (Perkin-Elmer) . Overlapping EST clones were 
identified by searching with the nucleotide sequence against 
entries in the EST database using BLAST-N 
( http : / /www . ncbi . nlm . nih . gov : 8 0/ cgi-bin/BLAST / nph- 
blast?Jf orm) . 

Additional IMAGE cDNA clones were ordered from HGMP if 
they extended the existing nucleotide sequence further 5'. 
cDNA clones corresponding to the most 5 ' extreme of each gene 
were identified by hybridization of radiolabeled inserts of 
IMAGE cDNA clones to a keratinocyte stem cell cDNA library, a 



human adult breast epithelial cDNA library constructed using 
Stratagene Lambda ZAP vector, and a Jurkat cell line cDNA 
library in pBluescript (Dunne, J. et al., 1995, Genomics 
30:207-223) . 

5 Northern analysis of members of the CD39-like gene 

family. cDNA. clone inserts were removed by restriction 
digestion and separated by gel electrophoresis. Insert DNA 
was gel-purified and radiolabeled (Sambrook et al., 1989, 
supra ) . Radiolabeled cDNA was prehybridized at 65°C for 2 h 
with 20 Mg of human Cot-1 DNA (GibcoBRL) and 100 of total 

10 human DNA (Sigma) , before hybridization to Northern blots 
(Clontech, human multiple tissue Northern blots, Catalog No. 
7760-1 and 7759-1) according to the manufacturer's 
instructions . 

Mapping of CD3 9L2. CD39L3. and CD39L4. Members of the 
CD3 9-like gene family were mapped in the human genome by PCR 

15 screening of the GeneBridge-4 radiation jhbrid mapping panel 
obtained from the HGMP Resource Centre (Hinxton, UK) (Gyapay, 
G. et al., 1996, Hum. Mol. Genet. 5:339-346). PCR-positive 
radiation hybrid clones were organized into the GeneBridge-4 
HGMP-RC subset order using the HGMP radiation hybrid mapping 
World Wide Web (WWW) site (http://www.hgmp.mrc.ac.uk/cgi- 

20 bin/contig/contig/rhmapper.pl) , and mapping data for each 
gene were obtained from the Whitehead server 
(http://www.genome.wi.mit.edu/cgi-bin/rhmapper.pl) . The 
chromosomal location for each gene was confirmed by PCR 
screening of the monochromosomal hybrids obtained from the 
HGMP Resource Centre. PCR primers were designed for the 3» 

25 untranslated region (UTR) of each gene and titrated for a 
unique human-specific PCR product. PCR conditions: CD39L2, 
Primer 1 , 5 ' -CTGCTTGAGTGACGTCTCTG-3 ' ; Primer 2 , 5 • - 
CACATGAGGTTCAGCTCGTG-3 ' ; 94°C for 2 min; 38 cycles of 94°C 
for 20 s, 54°C for 20 s, 72°C for 20 s; 72°C for 2 min. 
Product size is 362 bp). CD39L3, Primer 1: 5'- 

30 GTGAAGTGGCTGCCTTCAGG-3 * ; Primer 2, 5 * -CCTTTGACTCGGGACTCCAG- 
3*; 94°C for 2 min; 38 cycles of 94°C for 20 s, 56°C for 20 s, 
72°C for 2 min. Product size is 281 bp). CD39L4. Primer 1, 
5 • — GAACTGCTGCCTAACCACTC-3 * ; Primer 2 , 5 1 - 
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ATTGATGGGTCTTGGGATTGC-3 ' ; 94°C 2 for min; 38 cycles of 94 °C 
for 20 s, 56°c for 20 s; 72°c for 20 s; 72°c for 2 min. 
Product size is 234 bp. PCR products were analyzed by 
electrophoresis through 3.5% NuSieve agarose gels (Flowgen) . 

5 

7.2. Results 

Isolation and Sequence Characterization of CD39L2. 
Identification of partial human CD39L2 sequence was described 
in the Example presented in Section 6, above. The CD39L2 
insert was used to isolate additional clones from a human 

10 adult breast epithelial cDNA library (ZR75) , a human T- 
leukemia cell line J6 cDNA library (Jurkat) , and a human 
keratinocyte stem cell cDNA library (KER) . Of 2 3 cDNA clones 
that were isolated and sequenced, all but one appeared to be 
alternatively spliced or unspliced. Within the 2762 bp cDNA 
that appeared to be neither unspliced or alternatively 

15 spliced, an ORF extending to nucleotide 1600 containing ACRs 
I-IV was identified. Two ATG codons with a poor match to the 
consensus translation initiation site were found at 
nucleotide positions 148 and 232 ( AUGUG AAUG A at 148 and 
ACAAGGAUGA at 232 versus consensus GCCGCCAUGG; Kozak, M. , 
1989, J. Biol. Chem. 108 : 229-241) . Based on homology to 

20 mNTPase, the ATG at nucleotide position 232 is the initiation 
codon. (See FIG. 9 for a depiction of the CD39L2 amino acid 
sequence that results from translation from the upstream, 
position 148, start codon; such a form of CD39L2 as well as 
nucleotide sequences that encode this form of the polypeptide 
are also intended to be included as part of the present 

25 invention.) A single polyadenylation signal of AAUAAA was 
identified at nucleotide position 2700, 22 nucleotides 5' of 
the poly (A) tail of the human CD39L2 cDNA. 

The nucleotide sequence (SEQ ID N0:1) and derived amino 
acid sequence (SEQ ID NO: 2) of human CD39L2 is depicted in 
FIG. 4. Hydrophobicity plots using Topred-II 1.1 (Claros, 

30 M.G. & Von Hejine, G., 1994, Comput. Appl. Biosci. 10:685- 
686) predicted a single transmembrane segment at the N- 
terminal extreme of the protein, suggesting that CD39L2 has a 
short putative cytoplasmic tail and a large extracellular C- 
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terminal domain (FIG. 5) . There are two potential N- 
glycosylation sites in the predicted extracellular domain. A 
CAMP and cGMP-dependent protein kinase and a protein kinase-C 
phosphorylation site are found directly after the initiation 
5 methionine codon (nucleotide 2 32) . 

Isolation and Sequence Characterization of CD39L3. An 
additional novel gene and polypeptide CD39-like sequence, 
referred to herein as CD39L3, was also identified. BLASTN 
search of the NCBI EST database with the full cDNA sequence 
for human CD39 (Accession No. S73813) identified three EST 

10 entries for cDNA clones from an endometrial tumor library 

(Accession Nos. AA336644, AA338117, and AA337885) , which led 
to the identification of an IMAGE library EST (Accession No. 
N72742) that was completely sequenced and had amino acid 
homology to CD39 and CD39L1. The insert of N72742 was used 
to screen the Jurkat, ZR75, and KER cDNA libraries, and a 

15 single cDNA clone was isolated from the KER library and 
sequenced. 

A 1669-bp ORF was identified within the cDNA insert. 
The nucleotide sequence (SEQ ID NO: 3) and derived amino acid 
sequence (SEQ ID NO: 4) of the cDNA insert, referred to herein 
as CD39L3, are shown in FIG. 6. The amino acid sequence was 

20 revealed to contain ACRs I-IV, an ATG codon at position 83, 
and a single polyadenylation signal at position 2758. 
Hydrophobicity plots as described above predict two potential 
transmembrane segments at the N and C-terminal extremes of 
the protein (FIG. 5) . There are seven potential 
extracellular N-glycosylation sites. A cAMP- and cGMP- 

25 dependent protein kinase site and a protein kinase-C 

phosphorylation site are located at the C-terminal extreme of 
the protein. 

Isolation and Sequence Characterization of CD39L4. An 
additional CD39-like gene and polypeptide sequence, referred 
to herein as CD39L4, was also identified. A TBLASTX search 
30 of the NCBI EST database with the full cDNA sequence for the 
mNTPase was performed. A human EST clone was sequenced, and 
an ORF was identified extending to nucleotide 529 of 2260 
nucleotides that contained ACR I only and an ATG codon at 
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position 256. In the same reading frame, downstream of the 
stop codon at nucleotide 529, an ORF extending to nucleotide 
1792 contained ACRs II, III, and IV. Further analysis of the 
nucleotide sequence revealed a putative intron with splice 

5 donor and acceptor sites that conform to the 5 ' gt 3 ' ag 

rule (Breathnach and Chambon, 1981, Ann Rev. Biochem. 50: 349- 
383; splice donor CAGgtcacttatggagcctg at nucleotide 
position 470, acceptor ccatggacaaaatagGAC at position 710, 
exon sequence underlined) . Further analysis of the sequence 
revealed that removal of the 251-bp putative intron would 

10 result in a contiguous ORF containing ACRs I-IV. The 
hypothesis that this sequence does indeed constitute an 
intron was only confirmed by isolation and sequencing of 
three additional cDNA clones (CD39LAJ1-3) from the Jurkat 
library, one of which contained the 251 bp. 

The nucleotide sequence (SEQ ID NO: 5) and derived amino 

15 acid sequence (SEQ ID N0:6), referred to herein as CD39L4, is 
depicted in FIG. 7. The sequence contained a poly (A) tail, 
but no consensus polyadenylation sequence (Proudfoot, 1991, 
New Biol. 3:851-854). This is also the case for mNTPase. 
Hydrophobicity plots as described above predicted a single 
transmembrane segment at the N-terminal extreme of the 

20 protein (Fig. 5) . This is similar to the predicted topology 
of CD39L2 and different from that of CD39, Cd39Ll, and 
CD39L3. There are three potential extracellular N- 
glycosylation sites. 

FIG. 8 depicts an alignment of each of the above- 
described sequences, along with other members of the NTPase 

25 CD-39-like gene family. The ACR domains are indicated by 
arrows . 

Expression of the Human Members of the CD39-Like Family. 

Representative probes for each member of the CD39-like 
gene family were hybridized to human multiple tissue Northern 
30 blots. Results are presented below. 

CD39L2. Hybridization of the CD39L2 cDNA insert to the 
multiple tissue Northern blots resulted in two prominent 
signals of 2.6 kb (major) and 4.4 kb (minor) in all tissues 
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studied. This is most likely due to differential 
polyadeny lat ion . 

CD39L3 . A PCR product covering the coding sequence of 
CD39L3 was used for the Northern hybridizations. A strong 

5 signal of approximately 3.0 kb could be seen in adult brain, 
pancreas, spleen, and prostate. Though moderate or low 
expression was seen in most other tissues, no signal was 
detected in adult liver and peripheral blood leukocytes. A 
weaker signal of approximately 1.8 kb was found in adult 
pancreas and may be the result of alternative splicing. 

10 CD39L4 . The CD39L4 cDNA was hybridized to the same 

Northern blots, and a prominent signal of approximately 4.8 
kb was seen in adult liver, kidney, prostate, testis, and 
colon. Considerably weaker expression was seen for all other 
tissues examined. Several smaller bands were observed in 
tissues showing the strongest expression of CD39L4 and may be 

15 the result of differential polyadeny lation or alternative 
splicing. 

Mapping of Members of the CD39-like Family. 
The CD39L2 gene was mapped with a lod score of >19 to 
human chromosome 20 by PCR typing of the GeneBridge 4 
20 Radiation Hybrid Mapping Panel (Gyapay et al., 1996, Human 
Mol. Genet. 5:339-346). CD39L2 mapped 9.76 cR from D20S493 
(typing data: 12012 02101 22000 00111 00110 01210 00110 
01101 10121 00100 00120 11211 00011 11012 01001 01102 00000 
00000 001) . Using the closet flanking markers (D20S184 and 
D2 0S99) also represented on the consensus map, this placed 
25 CD39L2 at chromosome band 20qll.2. The location of CD39L2 on 
human chromosome 20 was confirmed by PCR analysis of 
monochromosomal mapping panels (Kelsell et al., 1995, Ann. 
Human Genet. 59:233-241). On the basis of synteny to human 
chromosome 20qll.2, the mouse homolog of CD39L2 was expected 
to map to mouse chromosome 2 (DeBry and Seldin, 1996, 
30 Genomics 33:337-351). 

The CD39L3 gene was mapped as described above to human 
chromosome 3, 5.76 cR from D3S3390 (data: 12002 02010 22000 
00011 20000 00110 01001 00000 02022 11000 10001 00200 21100 
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00212 01010 10002 00000 00011 001) . Using the closest 
flanking markers as described above (D3S1561 and D3S3564) , 
this placed CD39L3 at chromosome band 3p21.3. The location 
of CD39L3 on chromosome 3 was confirmed by PCR as for CD39L2. 

5 On the basis of synteny, the mouse homologue of Cd39L3 was 
expected to map to mouse chromosome 9 (DeBry and Seldin, 
1996, Genomics 33:337-351) . 

The CD39L4 gene was mapped as described above to human 
chromosome 14, 1.92 cR from D14S71 (data: 02102 02102 22000 
01010 11021 01000 01010 10110 02121 21000 00010 00211 01001 

10 10102 02012 00002 12111 01100 002). This placed CD39L4 at 
chromosome band 14q24. The chromosomal location of CD39L4 
was confirmed as described above. 

Identification of a Drosophila Gene with high Homology 
to CD39L2 and CD39L4. A D. melanogaster CD39-like gene was 

15 also identified. A TBLASTX search of the EST database using 
the human CD39L2 cDNA sequence, five Drosophila EST entries 
were identified (Accession No. AA391695, AA390461, AA201196, 
AA246996, and AA567512) . A consensus sequence was generated 
and used for a BLASTN search against EMBL/GenBank entries. A 
single D. melanogaster genomic entry (Accession No. AC002032) 

20 was identified showing 100% sequence identity to three 

regions of the EST consensus sequence. Alignment of the EST 
consensus against the genomic sequence identified three exons 
that conform to the 5' gt...3' ag rule (Breathnach & Chambon, 
1981, Ann. Rev. Biochem. 50:349-383). Exon 4 was identified 
on the basis of reading frame homology to the CD39L2 and 

25 CD39L4 proteins. An ATG codon was identified in exon 1, a 
stop codon in exon 4. 

The predicted amino acid sequence of the D. melanogaster 
CD39-like gene, referred to herein as dCD39L4, containing the 
ACRs-l-IV was shown in FIG. 9, aligned against the gene 
family members with the highest homology. Three N- 

30 glycosylation consensus sites were found in the putative 
extracelluar domain, and two potential cAMP- and c-GMP- 
dependent protein kinase phosphorylation sites were found in 
the putative N-terminal cytoplasmic domain. Hydrophobicity 
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plots as described above predicted a single transmembrane 
segment at the N-terminal extreme of the dCD3 9L4 protein. 
The topology of dCD39L4 is therefore most similar to the 
predicted topology of the CD39L2 and Cd39L4 proteins. 

5 

The present invention is not to be limited in scope by 
the exemplified embodiments which are intended as 
illustrations of single aspects of the invention, and 
compositions and methods which are functionally equivalent 
are within the scope of the invention. Indeed, numerous 

10 modifications and variations in the practice of the invention 
are expected to occur to those skilled in the art upon 
consideration of the present preferred embodiments. 
Consequently, the only limitations which should be placed 
upon the scope of the invention are those which appear in the 
appended claims. 

15 All references cited within the body of the instant 

specification are hereby incorporated by reference in their 
entirety. 
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